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7A  procedural  theory  of  judgment  is  described  in  which  judgment  Is  viewed 
as  a  serial  "anchoring  and  adjustment"  process.  The  process  is  described 
as  comprising  scanning,  anchoring,  and  adjusting  operations,  the  latter  of 
which  is  applied  iteratively  (with  order  of  adjustment  steps  usually  deter¬ 
mined  by  relative  Importance)  until  the  judge  deems  that  sufficient  infor¬ 
mation  has  been  integrated  and  outputs  a  final  response.  The  paper  has 
three  major  sections.  First,  the  procedural  theory  is  described  and 
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Toward  a  Procedural  Theory  of  Judgment 
Prologue 

A  few  years  ago  I  spent  several  months  trying  to  interest  a  new  graduate 
student  in  research  on  Judgment,  but  I  could  not  seem  to  arouse  any  enthusiasm. 
Finally,  in  desperation,  I  sent  him  off  to  the  laboratory  to  run  some  subjects 
on  a  pilot  project.  A  few  weeks  later  he  was  back  with  a  sheaf  of  computer 
output.  "l  want  to  show  you  something  interesting,"  he  said.  He  pointed  to 
a  sheet  on  which  a  single  subject's  data  were  laid  out  as  a  row-by-column 
array  with  the  responses  from  the  several  replications  listed  one  after  another 
within  the  cells.  "Look  here,"  he  said,  "everytime  you  give  the  subject  the 
same  stimulus  he  gives  almost  the  identical  answer.  And  almost  all  the  subjects 
do  the  same." 

This  student's  revelation  had,  for  me,  a  bittersweet  quality,  for  he  had 
"discovered"  what  all  of  us  who  have  studied  judgment  seriously  take  as  given: 
human  beings,  in  making  quantitative  judgments  about  complex  stimuli,  do  so 
in  a  manner  that  is  systematic  and  replicable.  Furthermore,  their  judgments 
often  reveal  algebraic  patterns  that  suggest  the  operation  of  some  internal 
mechanism  for  "computing"  averages,  products,  relative  ratios  and  so  forth. 
Subjects,  however,  are  not  conscious  of  these  computations  nor  are  they  aware 
of  the  algebraic  form  of  their  judgments.  Where  do  the  judgments  come  from? 

What  psychological  processes  give  rise  to  this  algebra-less  algebra?  These 
are  questions  whose  interest  has  eluded  not  only  that  particular  graduate 
student,  but  also  a  number  of  professional  researchers  who  have  perfunctorily 
crltlzed  algebraic  judgment  models  for  their  "as  if"  status  and  then  moved  on 
to  study  other  topics. 

Strictly  speaking,  the  present  paper  is  a  final  report  on  a  research 
project  that  has  attempted  to  show  why  subjects  in  a  particular  task  —  the 
Bayesian  inference  task  —  produce  judgments  that  are  more  like  averages  than 
like  inferences.  The  project,  however,  was  based  on  a  general  procedural 
theory  of  judgment  that  applies  not  only  to  averaging,  but  to  other  algebraic 
forms  as  well.  Thus,  the  broader  purpose  of  the  present  report  is  to  lay  out 
the  rudiments  of  this  procedural  theory  more  generally  than  has  been  possible 
in  the  previously  published  experimental  reports  (Lopes,  1981,  1982a, b). 
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The  report  will  be  divided  Into  three  sections.  First,  I  v.  11  describe 
the  procedural  theory  and  contr.ast  it  with  the  implicit  characterization  of 
process  that  might  be  drawn  from  algebraic  models  of  judgment.  Second,  I  will 
describe  three  common  forms  of  judgment  (averaging  rules,  relative  ratio  rules, 
and  multiplying  rules)  and  will  discuss  the  factors  that  predispose  subjects  to 
use  these  rules  for  particular  tasks.  Third  and  finally,  1  will  show  how 
several  different  phenomena  of  judgment  (primacy  and  recency  effects,  initial 
impression  effects,  differential  weighting  effects,  and  violations  of  additivity) 
can  be  interpreted  within  the  procedural  theory. 

A  Procedural  Theory  of  Judgment 

In  the  present  view,  judgments  are  produced  via  a  serial  adjustment  process 
in  which  an  initial  or  "anchor"  quantity  is  "adjusted"  one  or  more  times  in 
accordance  with  other  available  information.  The  idea  that  judgment  is  a  pro¬ 
cess  of  serially  adjusting  an  Internal  quantity  was  first  suggested  by  Slovic 
(1967)  under  the  name  "polarization  and  adjustment"  and  later  named  "anchoring 
and  adjustment"  by  Tversky  and  Kahneman  (197A)  .  More  recently,  the  process  has 
been  hypothesized  to  account  for  averaging-like  results  in  Bayesian  inference 
(Lopes,  1981,  1982a, b;  Wallsten,  1976b),  diagnostic  inference  (Einhorn  &  Hogarth, 
1982),  and  similarity  judgment  (Lopes  &  Johnson,  1982;  Lopes  6  Oden,  1980). 

The  present  paper  extends  these  earlier  treatments  by  showing  how  serial  adjust¬ 
ment  processes  mediate  the  algebraic  form  of  the  judgments  produced. 

A  flow  diagram  of  the  proposed  process  is  given  in  Figure  1.  The  process 
is  described  as  comprising  a  set  of  operations  in  which  information  is  scanned, 
items  are  selected  for  processing,  scale  values  are  assessed,  and  adjustments 
are  made  (at  least  after  the  first  step)  to  an  interim  quantity  that  summarizes 
the  results  from  already-processed  information.  These  various  operations  are 
described  in  detail  below. 

Scann Ing .  In  the  scanning  operation  the  judge  assesses  what  information 
has  been  presented  for  judgment.  The  details  of  this  stage  depend,  of  course, 
on  how  the  Information  is  presented  to  the  judge.  Three  cases  can  be  distin¬ 
guished.  In  sequential  presentation  the  items  of  information  are  presented  one 
at  a  time,  with  or  without  an  overt  response  from  the  judge  after  each  item. 
Obviously,  the  scanning  stage  is  rudimentary  In  sequential  presentation  since 
at  any  moment  there  is  only  one  item  to  scan. 
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Figure  1.  Flow  diagram  of  serial  adjustment  judgment  model. 


In  und  ifforcntlated  s imu 1 1 aneuus  prosentat ion  several  items  of  Information 
are  presented  it  the  same  time  and  there  is  no  a  priori  reason  to  suppose  that 
any  particular  item  is  more  important  than  any  other.  In  such  a  case,  if  tlie 
number  of  items  presented  is  small,  sc-anning  will  probably  include  all  available- 
items,  with  the  order  of  scanning  determined  by  the  physical  format  of  the 
stimulus;  i.e.,  if  the  information  is  presented  as  a  list,  scanning  will  probably 
proceed  from  top  to  bottom.  If  the  number  of  items  of  information  is  large, 
however,  scanning  may  terminate  before  all  available  items  have  been  scanned. 

In  differentiated  simultaneous  presentation  several  items  of  information 
are  presented  at  once  but  items  differ  from  one  another  in  terms  of  the  kind  of 
information  they  contain  or  in  terms  of  their  relevance  for  the  judgment  task. 

One  example  might  be  an  application  for  graduate  school  containing  an  applicant's 
GTA,  GRE  scores,  undergraduate  college,  letters  of  reference,  personal  history, 
and  reasons  for  wanting  to  pursue  advanced  study.  Another  example  might  be  an 
advertisement  for  an  apartment  giving  rent,  locale,  number  of  rooms,  and  so  forth. 
IVith  complicated  materials  such  as  these  the  scanning  process  will  undoubtedly 
reflect  the  judge's  perception  of  the  relative  importance  or  usefulness  of  the 
different  types  of  information.  Some  items  may  reliably  be  scanned  before  other 
items  (e.g.,  GPA  and  GRE  scores  may  be  scanned  first  in  a  graduate  school  appli¬ 
cation)  and  some  items  may  not  be  scanned  at  all,  or  at  least  not  on  the  initial 
pass  (e.g.,  hobbles  or  reasons  for  wanting  to  attend  graduate  school). 

It  is  assumed  that  the  primary  function  of  the  scanning  operation  is  to 
orient  the  judge  to  the  information  available  in  the  stimulus.  Although  some 
preliminary  evaluation  and  integration  of  the  stimulus  material  may  occur  during 
the  stage,  this  would  give  only  a  rough  impression  of  the  value  of  the  stimulus. 
Such  a  rough  impression  would  ordinarily  not  be  the  final  response,  since  that 
would  be  developed  as  the  result  of  later  and,  usually,  more  deliberate  passes 
through  the  stimulus  Information. 

There  may,  of  course,  be  exceptions  to  this  rule.  For  example,  in  scanning 
applications  for  graduate  school,  a  judge  may  come  across  one  that  is  deficient 
in  every  respect.  If  a  very  crude  scale  of  judgment  is  being  used  (e.g., 
assigning  the  applicant  to  one  of  five  categories)  the  judge  may  assign  the 
application  to  the  lowest  category  without  further  procer-sing.  If,  however,  the 
judge  is  using  a  more  continuous  scale  of  judgment  (e.g..  rating  the  applicant's 
probability  of  success  in  graduate  school)  and  if,  as  is  usually  the  case  in 
experimental  tasks,  the  judge  has  been  cautioned  against  making  hasty  "end 
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responses,"  then  further  processing  would  be  required  In  order  to  position  the 
stimulus  reliably  on  the  judgment  scale. 

Anchoring .  After  the  stimulus  information  has  been  scanned,  the  judge 
chooses  some  item  of  information  to  serve  as  a  quantitative  "anchor"  (Tversky 
&  Kahneman,  197A)  for  the  judgment  process.  If  the  Information  is  presented 
sequentially,  of  course,  only  one  item  is  available  at  a  time  so  the  judge  must 
use  the  first  item  presented  as  the  anchor.  If  the  information  is  presented 
simultaneously,  the  Judge  has  several  items  to  choose  from.  For  items  that  are 
relatively  undifferentiated  (e.g.,  a  list  of  trait  adjectives  in  an  impression 
formation  task)  the  judge  may  simply  anchor  on  whatever  adjective  happens  to 
be  on  top  of  the  list.  For  items  that  are  differentiated,  however,  the  judge 
will  ordinarily  select  an  anchor  on  the  basis  of  the  a  priori  importance  of 
the  information  typo.  For  instance,  GRE  scores  or  GPA  might  be  chosen  to  anchor 
judgments  of  graduate  school  applicants.  Alternatively,  an  item  might  be  chosen 
because  of  the  unusual  dlagnosticity  of  its  particular  information.  For  example, 
prior  work  experience  would  probably  not  be  considered  very  important  for  judging 
graduate  school  applicants.  But  experience  as  a  lion  tamer  might  be  considered 
interesting  enough  to  use  as  an  anchor  for  someone  applying  for  study  in  animal 
behavior  (or  risky  decision  making). 

As  will  be  seen,  the  anchoring  operation  can  have  important  consequences 
for  the  judgment  process.  As  Tversky  and  Kahneman  (197A)  pointed  out,  adjustments 
to  the  anchor  quantity  are  often  "insufficient"  in  the  sense  that  final  judgments 
often  lie  nearer  to  the  anchor  than  they  would  if  the  process  were  independent  of 
which  stimulus  served  as  the  anchor.  Thus,  anchoring  and  adjustment  can  lead  to 
judgmental  primacy  which,  in  turn,  can  affect  the  algebraic  pattern  of  the  judg¬ 
ments  produced.  These  anchoring  effects  will  be  discussed  more  fully  in  the  sec¬ 
tions  on  primacy/recency  and  on  differential  weighting. 

Valuation .  Once  an  anchor  stimulus  has  been  chosen,  it  must  be  evaluated 
relative  to  the  scale  of  judgment.  In  some  cases  this  valuation  operation  may 
yield  a  quantity  that  serves  directly  as  the  anchor  value.  For  example,  in  a 
Bayesian  inference  task,  subjects  may  simply  anchor  their  judgment  at  the  scale 
position  that  is  proportional  to  the  number  of  "target"  items  in  the  initial 
sample  (Lopes,  1981) ,  In  other  cases,  however,  the  Initial  judgment  may  be  less 
extreme  than  the  scale  value  of  the  anchor  stimulus.  In  these  cases  the  initial 
judgment  appears  to  be  a  compromise  between  the  value  of  the  stimulus  information 
and  some  internal,  presumably  more  neutral  "initial  impression"  (Anderson,  1967). 
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As  can  be  seen  from  the  figure,  stimulus  valuation  occurs  not  only  In  the 
course  of  anchoring  the  Judgment,  but  also  in  subsequent  stages  as  other  stimulus 
information  is  brought  into  the  judgment.  This  valuation  process  can  be  simple 
or  complex  depending  on  the  nature  of  the  judgment  task.  In  some  cases,  the 
stimulus  information  seems  to  "contain"  the  pertinent  value  information  fairly 
directly.  This  is  seen  most  easily  in  the  impression  formation  task  in  which 
the  given  personality  trait  adjectives  seem  to  contain  the  required  "likeableness" 
values  as  part  of  their  meaning.  In  other  cases,  however,  inference  must  be  done 
to  determine  the  scale  value  of  the  stimulus  information.  This  would  be  true, 
for  example,  in  a  task  in  which  subjects  were  asked  to  use  aptitude  information, 
to  judge  the  proficiency  of  people  in  various  occupations  (e.g.,  Anderson  &  Lopes, 
1974) . 

In  many  judgment  tasks  the  valuation  process  is  complete  when  the  scale 
value  of  the  currently  attended  stimulus  has  been  obtained.  In  some  situations, 
however,  the  obtained  scale  value  is  processed  further  by  comparing  it  with  some 
expectancy  or  with  the  values  of  previous  stimuli.  For  example,  Lopes  (1972) 
showed  how  some  striking  contrast  effects  obtained  by  Jones,  Worchel ,  Coethals, 
and  Grumet  (1971)  in  an  attitude  attribution  task  could  be  explained  without 
recourse  to  notions  of  stimulus  interaction.  It  was  proposed  that  subjects 
evaluated  the  content  of  an  essay  purportedly  written  by  the  target  person  against 
the  sort  of  essay  such  a  target  person  might  have  been  expected  to  write.  The 
discrepancy  between  the  actual  and  the  obtained  essay  was  then  integrated  into  the 
judgment  along  with  the  value  of  the  essay  itself. 

In  a  similar  vein,  some  subjects  in  a  goal-setting  task  (Lopes,  1976)  seemed 
to  set  their  goal  for  the  next  trial  in  a  maze-running  problem  in  accordance 
both  with  their  scores  from  the  previous  few  trials  and  with  the  discrepancy 
between  their  most  recent  score  and  their  next-most-recent  score.  For  these 
subjects,  it  was  as  though  they  evaluated  their  performance  on  each  trial  not  only 
In  terms  of  the  score  itself,  but  also  in  terms  of  whether  or  not  they  showed 
improvement  on  the  task. 

Adjustment .  Once  anchoring  has  been  accomplished,  the  judge  must  decide 
whether  there  are  still  important  items  unprocessed.  If  so,  the  process  essen¬ 
tially  reiterates,  with  the  judge  choosing  which  of  the  remaining  items  to 
consider  next.  As  can  be  seen  In  the  figure,  the  considerations  at  this  point 
are  exactly  what  they  were  at  the  time  of  choosing  the  anchor;  if  one  of  the 
remaining  items  is  clearly  more  important  than  the  others,  the  subject  chooses 
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It,  otherwise  an  item  Is  chosen  arbitrarily  and  the  scale  value  of  the  chosen 
Item  is  then  determined. 

The  next  step  In  the  judgment  process  is  adjustment  of  the  initial  value 
according  to  the  new  information.  It  Is  this  step  that  is  most  crucial  in  deter¬ 
mining  whether  the  algebraic  form  of  the  judgment  is  an  average,  product,  rela¬ 
tive  ratio,  or  other  combination  rule.  Detailed  discussion  of  how  the  adjustment 
process  is  related  to  the  algebraic  form  of  the  judgment  will  be  deferrea  to  the 
section  on  algebraic  rules.  For  the  moment  it  suffices  to  say  that  the  adjustment 
process  has  two  Important  subprocesses:  choice  of  the  direction  in  which  adjust¬ 
ment  will  occur  and  choice  of  the  magnitude  of  the  adjustment.  These  subprocesses 
will  reflect,  in  large  part,  the  judge's  internal  representation  of  the  judgment 
task  and  the  judgment  scale. 

The  Psychological  Nature  of  Algebraic  Processes 

Evidence  supporting  the  algebraic  nature  of  human  judgments  has  been  accumu¬ 
lating  for  more  than  20  years  (cf.  Anderson,  1981).  However,  as  Graesser  and 
Anderson  (1974,  p.  697)  pointed  out,  establishing  an  algebraic  model  for  judgment 
data  is  "only  a  first  step  in  the  analysis  of  the  judgment  process."  Beyond 
this  lies  a  clear  need  to  reach  an  understanding  of  the  cognitive  mechanisms  that 
are  involved  when  people  produce  averages,  products,  or  other  algebraic  forms. 

Algebraic  models  are  ordinarily  expressed  in  what  I  will  call  declarat ive 
form.  By  this  1  mean  that  all  the  variables  that  enter  the  judgment  are  expressed 
along  with  the  algebraic  rule  that  defines  the  relationship  among  these  variables 
in  the  judgment.  No  Indication  is  given,  however,  of  the  procedures  through 
which  the  computation  is  performed.  Thus,  the  model  applies  to  the  final  output 
of  the  judgment  process  rather  than  to  the  stages  through  which  the  judgment  is 
generated . 

For  Illustration,  consider  the  forms  in  which  averaging  models  are  usually 
expressed.  Equation  1  gives  the  averaging  rule  in  absolute  weight  form: 

N 

■Si  ■  -5 - 

T.  w 

i 

i=l 

In  this  form,  the  response  given  to  N  stimuli,  Rjj,  is  seen  as  the  quotient  of  two 
sums;  in  the  numerator,  the  sum  of  the  various  stimulus  scale  values  (s^)  each 
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multiplieiJ  by  their  respective  weiglits  (w^)  ,  and  in  the  denominator,  the  sum 
of  the  weiglits.  (For  simplicity  I  have  Ignored  the  possibility  of  an  initial 
impression.  If  there  were  one,  it  would  be  assigned  weight  Wq  and  scale  value 
Sp  and  the  summation  would  run  from  i  0  to  N.) 

Equation  2  gives  the  same  averaging  rule  in  relative  weight  form: 


!<  ■; 
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This  expression  differs  from  the  absolute  weight  form  primarily  in  that  it  suggests 
that  the  various  scale  values  are  each  multiplied  not  by  their  absolute  weights, 
but  by  their  relative  weights.  These,  in  turn,  are  simply  ratios  of  the  individual 
absolute  weights  to  the  total  absolute  weight.  (Sometimes  the  relative  weight 
form  of  the  averaging  rule  is  written  without  showing  the  normalization  of  the 
absolute  weights,  i.e.,  the  division  by  the  sum  of  the  weights.  This  is  done  by 
noting  outside  the  equation  that  the  sum  of  the  weights  is  1.) 

Equations  1  and  2  are  intended  to  describe  the  output  of  the  judgment  process, 
but  not  the  process  Itself.  In  particular,  the  sequence  of  operations  that  one 
would  use  to  literally  instantiate  computation  of  an  average  via  either  of  these 
equations  is  not  intended  as  a  description  of  the  sequence  of  operations  that 
judges  perform  during  the  judgment  process.  Instead,  the  equations  are  intended 
to  show  (a)  what  quantities  must  be  supposed  to  be  functional  In  the  judge's 
representation  of  the  stimulus  information  and  (b)  how  these  quantities  are  related 
to  one  another  mathematically  in  the  description  of  the  final  judgment. 

A  major  assumption  of  the  present  procedural  theory  is  that  the  judgment  pro¬ 
cess  operates  serially  by  producing  "running"  values  as  stimuli  are  sequentially 
selected  and  integrated  into  the  judgment.  Equation  3  shows  how  an  averaging 
model  can  be  written  to  emphasize  this  serial  integration; 


In  the  equation,  is  the  response  (either  internal  or  external)  after  the  first 
n  stimuli  have  been  integrated.  It  is  shown  as  a  weighted  average  of  the  scale 
value  of  the  new  stimulus  item  (s^)  and  the  old  response,  i.e.,  the  response  value 
after  n-1  stimuli. 


Equation  3  captures  the  serial  aspect  of  judgment,  but  it  does  not  convey 
the  sense  that  the  old  response  is  transformed  to  a  new  response  by  an  adjustment 
process  that  increases  or  decreases  the  value  of  the  old  judgment  in  accord  with 
the  relationship  between  the  value  of  the  old  judgment  and  the  value  of  the  new 
information.  Equation  3  can  be  rewritten,  however,  to  a  change  form  that  captures 
the  notion  of  adjustment  fairly  well: 


R 

n 


w  (s  -R  ,) 
n  n  n-1 


+  R 


'n-1 


(A) 


In  this  form,  the  variable  w^  stands  for  the  relative  weight  of  the  information 
currently  being  Integrated.  Note  that  the  new  judgment,  R^,  is  simply  the  old 
judgment  plus  or  minus  some  proportion  of  the  difference  between  the  value  of  the 
new  information  and  the  value  of  the  old  response. 

Equation  4  comes  much  closer  than  the  previous  equations  to  capturing  in 
algebraic  terms  the  various  psychological  operations  that  are  hypothesized  to 
operate  when  judges  produce  averages.  There  are,  however,  certain  respects  in 
which  even  this  equation  misrepresents  important  psychological  features  of  the 
judgment  process.  To  begin  with.  Equation  4  an  equation,  which  is  to  say  that 
it  shows  how  certain  numerical  (i.e.,  digital)  quantities  are  to  be  operated  on 
arithmetically  to  produce  other  numerical  quantities.  In  contrast,  the  procedural 
theory  outlined  above  need  not  have  its  various  values  instantiated  numerically 
(although  they  could  be).  Likewise,  there  would  need  be  no  literal  subtraction  of 

I 

R  from  s  ,  no  literal  multiplication  of  the  difference  by  w  ,  and  no  literal 
n  “  I  n  ^  n 

add.it  ion  of  the  quantity  to  R 

^  n-1 

'a  t  tie  <econd  place.  Equation  4  does  not  reveal  how  the  characteristics  of 
the  various  stimuli  to  be  integrated  influence  the  order  in  which  they  are  pro¬ 
cessed  or  how,  in  turn,  the  order  of  processing  affects  the  algebraic  form  of  the 
judgment.  This  is  particularly  true  with  respect  to  the  important  ordering  role 
played  by  weights  in  the  procedural  theory.  Information  integration  theory  has 
traditionally  emphasized  the  theoretical  necessity  of  construing  stimuli  in  inte¬ 
gration  tasks  to  have  both  weights  and  scale  values  (cf.  Anderson,  1981,  p.  334- 
355),  but  it  has  not  specified  the  different  functions  that  weights  and  scale 
values  play  in  judgment  except  to  point  out  that  weights  measure,  more  or  less, 
the  salience  or  amount  of  information  in  a  stimulus  whereas  scale  values  measure 
the  position  of  a  stimulus  on  a  particular  scale  of  judgment.  The  present  proce¬ 
dural  theory  amplifies  this  important  distinction  between  weights  and  scale 
values  by  showing  their  functional  differences  in  the  judgment  process. 
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Finally,  Equation  A  does  not  specify  how  tlie  internal  judgment  is  related  to 
the  external  response.  Traditionally,  algebraic  theories  have  treated  the  inte¬ 
gration  stage  (in  which  the  algebraic  mechanism  operates)  as  quite  separate  from 
the  response  stage  in  which  the  judge  interprets  his  internal  judgment  in  the 
explicit  terms  required  by  the  experiment.  Anderson  (cf.  1981,  p.  A-?),  for  exam¬ 
ple,  refers  to  a  "psychoniotor  law"  in  which  a  response  function  operates  to  map 
an  implicit  response  into  an  ob.servable  response.  This  separatiin  >f  internal 
from  external  response,  in  turn,  has  led  some  algebraic  theorists  to  distrust  the 
surface  patterns  of  judgment  data  and  to  suppose  that  the  internal  algebraic  pro¬ 
cess  may  be  quite  different  from  that  suggested  by  the  data  (cf.  Birnbaum,  in  press). 
In  the  prevent  view,  internal  and  external  response  processes  are  brought  closer 
together.  Although  it  is  obvious  that  thoughts  differ  fundamentally  from  their 
observable  manifestations  in  judgment  tasks  (c.g.,  marks  made  on  paper,  movements 
made  on  a  potentiometer,  etc.),  it  will  be  seen  that  there  is  often  a  close 
connection  between  internal  or  Implicit  responses  and  some  of  the  more  superficial 
characteristics  of  the  external  response  scale. 

Adjustment  Processes  and  Algebraic  Form 

In  this  section  I  will  show  how  adjustment  processes  can  be  related  to  three 
algebraic  patterns  (averaging  rules,  relative  ratio  rules,  and  multiplying  rules) 
that  are  commonly  observed  in  judgment  data.  Before  proceeding,  however,  it  is 
useful  to  point  out  some  psychologically  important  differences  between  the  sort  of 
analogical  procedures  that  are  assumed  by  the  present  model  and  more  ordinary 
numerical  procedures  for  computing  algebraic  results. 

For  sake  of  exposition,  suppose  that  a  subject  is  asked  to  average  73  and  A2 . 

One  way  to  proceed  would  be  to  add  73  and  A2  to  yield  115  and  then  divide  the 
result  by  2  to  yield  57.5.  Numerical  processes  such  as  this  are  characterized  by 
three  properties:  (1)  The  numerals  that  are  manipulated  in  the  aritfimetic  are 
symbols  that  have  been  assigned  to  quantities  arbitrarily.  (2)  The  operations 
that  are  applied  to  the  numerals  arc  rewrite  rules  that  specify  the  mapping  between 
the  ordered  strings  of  numerals  in  the  problem  statement  and  the  ordered  string 
of  numerals  in  the  answer.  For  example,  in  adding  73  and  A2,  we  know  that  the  ”3" 
and  the  "2”  in  the  ones  places  of  the  two  addends  comblrie  to  yield  a  "5"  in  the 
ones  place  of  the  answer.  (3)  Quantitative  interpretation  of  the  answer  string 
requires  information  that  Is  not  in  the  string  itself.  That  is,  since  the  answer 
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string  docs  not  represent  quantity  directly,  the  interpreter  or  decoder  of  the 
string  must  apply  outside  knowledge  about  the  mapping  between  strings  of  numerals 
and  the  quantities  they  represent. 

Another  way  to  average  the  numbers  would  be  analogically.  Tills  could  be  done, 
lor  example,  by  locating  the  value  73  on  some  quantitative  continuum,  then  finding 
the  location  of  42  on  the  same  continuum,  and  finally  moving  halfway  between  73 
and  42.  Note  that  none  of  these  operations  would  need  involve  numerals  since 
analogical  processes  use  quantities  (e.g.,  spatial  position,  intensity,  etc.) 
directly  to  represent  other  quantities  (e.g.,  the  average  of  73  and  42).  For  exam¬ 
ple,  if  the  continuum  wore  printed  as  a  line  on  a  piece  of  paper,  the  scale  posi¬ 
tions  corresponding  to  73  and  42  could  be  located  and  folded  to  meet  one  another. 
The  position  of  the  fold  would  then  mark  the  answer  of  57.5. 

There  are  several  good  reasons  to  doubt  that  the  cognitive  processes  that 
mediate  algebr.iic  judgment  arc  numerical.  The  first  is  that  for  many  judgment 
tasks,  neither  the  stimuli  nor  the  responses  may  ever  be  represented  numerically, 
as  would  bo  necessary  for  numerical  computation.  In  impression  formation,  for 
example,  subjects  are  typically  asked  to  read  a  set  of  personality  trait  adjectives 
and  then  rate  (often  on  an  unmarked  continuous  scale)  the  likeableness  of  a  person 
having  the  traits.  In  verbal  tasks  such  as  these  it  is  extremely  unlikely  that 
subjects  would  recode  the  stimulus  information  into  numerical  form  and  then  operate 
on  that.  Instead,  it  appears  that  human  judgment  processes  can  operate  as  readily 
on  quantities  that  are  defined  verbally  or  vaguely  as  it  can  on  quantities  that 
happen  to  be  expressed  numerically. 

A  second  reason  is  that  judgmental  difficulty  does  not  seem  to  increase  as 
quickly  as  a  function  of  problem  size  as  would  be  expected  if  the  judgment  process 
involved  the  sorts  of  symbol  manipulation  that  occur  in  numerical  computation. 

For  example,  Shanteau  (1974)  had  subjects  rate  the  worth  of  single  gambles  (e.g., 
fairly  likely  to  win  sandals)  and  double  gambles  (e.g.,  fairly  likely  to  win 
sandals  and  unlikely  to  win  bicycle).  Even  though  the  double  gambles  were  obvious¬ 
ly  more  complex  than  the  single  gambles,  there  was  no  tendency  for  the  single 
gambles  to  produce  "better"  data.  This  result  contrasts  strongly  with  results 
from  studies  of  mental  arithmetic  (e.g.,  Dansereau  &  Gregg,  1968;  Hitch,  1978)  in 
which  subjects  perform  numerical  computations  without  pencil  and  paper.  In  these 
tasks,  both  errors  and  latency  to  solution  tend  to  increase  as  problem  difficulty 
increases . 


A  third  and  final  reason  is  that  it  is  rare  for  subjects  in  judgment  tasks 


Sine  e 
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to  know  Intellectually  that  their  , judgments  conform  to  .in  algebraic  rule, 
numerical  computation  retjuire.s  explicit  and  rule-based  manipulation  of  symljols, 
it  seems  exceedingly  unlikely  that  subjects  could  be  using  even  a  nicJimentarv  form 
of  numerical  computation  without  having  at  least  some  awareness  of  the  basis  for 
their  judgments. 

Analogical  processes,  on  tiie  other  hand,  can  provide  a  plausible  acccjunt  of 
how  algebraic  judgments  are  produced.  As  will  be  detailed  in  the  sect  ions  immedi¬ 
ately  following,  they  can  operate  via  a  variety  of  serial  adjustment  procedures 
to  produce  averages,  relative  ratios,  products,  and  other  algebraic  results  from 
stimulus  quantities  that  are  expressed  either  exactly  or  inexactly.  In  addition, 
they  are  computationally  efficient,  putting  few  demands  on  short  term  memory  and 
allowing  problems  of  varying  size  to  be  computed  iteratively  simply  by  recycling 
through  the  processes  of  stimulus  selection,  valuation,  and  adjustment  until  all 
(or  sufficient)  stimulus  information  has  been  integrated. 

Averaging 

Of  the  several  algebraic  models  that  have  been  proposed  to  describe  human 
judgment  processes,  the  model  that  is  empirically  most  pervasive  and  theoretically 
best  understood  is  averaging.  Subjects  appear  to  average  in  a  variety  of  judgment 
situations  ranging  from  number  averaging  in  which  averaging  is  the  requested 
response  to  impression  formation  in  which  averaging  is  neither  demanded  nor  di.s- 
allowed  logically  to  Bayesian  inference  in  which  averaging  deviates  both  quanti¬ 
tatively  and  qualitatively  from  what  is  normative. 

The  most  commonly  obtained  form  of  averaging  is  what  is  called  constant 
weighted  averaging.  The  terra  "constant  weighting"  refers  to  the  fact  that,  for 
composite  stimuli  generated  by  factorial  rt)mbination  of  stimulus  elemc’it.s, 
the  weights  associated  with  all  stimuli  within  a  given  factor  are  equal.  Weights 
can,  however,  vary  between  factors. 

In  constructing  a  procedural  theory  for  constant  weighted  r.veraging,  two 
important  empirical  findings  must  be  explained.  The  first  is  the  basic  averaging 
result  which,  simply  put,  refers  to  the  fact  that  averaging  one  quantity  with 
another  always  results  in  a  new  quantity  that  lies  between  the  two  original 
quantities.  Thus,  if  neutral  information  is  averaged  with  information  of  high 
value,  the  neutral  information  lowers  the  average.  On  the  other  hand,  if  the  same 
neutral  information  is  averaged  with  information  of  low  value  it  raises  the 
average.  The  second  finding  is  the  parallel  ism  result  which  refers  to  the 
fact  that  if  judgments  generated  by  a  constant  weighted  averaging  process  are 
plotted  as  a  factorial  graph,  parallelism  will  obtain  between  the  various  rows. 
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Both  averaging  and  parallelism  are  easily  explained  within  the  proposed 
serial  adjustment  theory.  Sticking  for  the  moment  to  the  number  averaging 
example  described  above,  suppose  that  a  subject  is  asked  to  average  the  follow¬ 
ing  set  of  numbers: 

73 

42 

63 

67 

The  first  step  would  be  to  anchor  the  judgment  at  73  (since  it  is  first  on  the 
list)  and  then  adjust  downward  to  a  position  lying  between  73  and  42.  This 
between-ness  property  of  the  interim  response  would,  by  definition,  constitute 
a  generalized  averaging  result.  If  in  addition  the  magnitude  of  the  adjustment 
was  one-half  the  distance  between  73  and  42  (i.e.,  57.5),  the  obtained  average 
would  be  the  simple  arithmetic  average  of  the  two  quantities  and,  since  arithmetic 
averaging  constitutes  a  special  case  of  constant  weighted  averaging,  parallelism 
would  also  hold  in  a  f actorially-def ined  pet  of  such  stimulus  pairs. 

The  next  step  in  the  averaging  procedure  would  be  adjustment  of  the  interim 
response  to  account  for  the  value  63.  For  arithmetic  averaging,  the  magnitude 
of  this  adjustment  would  have  to  be  one-third  the  distance  between  57.5  and  63 
giving  a  new  interim  response  of  59.33.  The  final  adjustment  step  would  follow 
the  same  pattern  with  an  upward  adjustment  one-fourth  the  distance  between  59.33 
and  67.  Note  that  for  arithmetic  averaging  the  adjustment  rule  is,  for  the  nth 
stimulus  in  the  average,  to  adjust  1/n  of  the  distance  between  the  scale  value 
of  the  nth  stimulus  and  the  interim  response  to  the  first  n-1  stimuli. 

Arithmetic  averaging  is  special  in  the  sense  that  even  when  it  is  performed 
analogically,  there  still  seems  to  be  a  strong  numerical  base  to  the  process 
both  in  the  representation  of  the  stimulus  values  and  in  the  fractional  specifica¬ 
tion  of  the  adjustment  constants.  These  numerical  overtones  are,  however,  irre¬ 
levant  in  the  more  general  definition  of  the  averaging  process.  Constant  weighted 
averaging  requires  only  two  things:  first,  that  the  adjustment  process,  however 
it  occurs,  yields  a  value  that  lies  between  the  value  of  the  stimulus  being  inte¬ 
grated  and  the  value  of  the  "old"  response;  and  second,  that  the  adjustment 
constant  for  a  particular  serial  position  be  independent  of  the  content  of  the 
stimulus  Information  at  that  serial  position. 

For  illustration,  Figure  2  shows  how  a  person  might  integrate  a  set  of 
personality  traits  in  an  impression  formation  task  without  resorting  to  numerical 
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Figure  2,  Example  of  how  an  analog  Jiidgment  process  might  opt'rat*  on  i\ou- 
numerlcal  input. 
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processes.  The  stimulus  set  is  showu  at  the  top  of  the  figure.  The  central 
portion  of  the  figure  shows  (a)  in  the  left  column,  fictional  thoughts  occurring 
(luring  judgment,  (b)  in  the  middle  column,  a  designation  of  the  current  judgment 
stage  in  terms  of  the  procedural  model  of  Figure  1,  and  (c)  in  the  right  column, 
a  verbal  expression  of  the  running  impression.  At  the  bottom  of  the  figure  is 
given  a  hypothesized  evaluative  continuum  with  occasional  verbal  labels.  This 
continuum  is  intended  to  represent  a  possible  internal  schema  for  evaluative  terms. 
It  is  not  intended  to  be  taken  literally  as  a  spatial  scale,  nor  is  it  Intended 
that  the  verbal  labels  shown  exhaust  the  verbal  descriptions  that  a  subject  could 
give  of  positions  on  the  continuum. 

In  Step  I,  the  judge  scans  the  stimulus  set.  Since  the  items  are  fairly 
undifferentiated,  the  judge  opts  (in  Step  2)  for  processing  them  in  the  order 
listed.  In  Steps  3  and  4,  then,  the  first  item  is  evaluated  and  used  to  anchor 
the  judgment.  Note  that  in  this  example,  the  judge  began  with  a  neutral  initial 
impression.  Thus,  the  impression  value  generated  at  Step  4  is  somewhat  less 
extreme  than  the  value  of  the  anchor  adjective  taken  alone.  In  Step  5,  the 
subject  selects  the  next  item  for  processing  and,  in  Step  6,  evaluates  it. 

Step  7  represents  the  adjustment  of  the  current  impression  toward  the  new  stimulus 
item.  Note  that  for  constant  weighted  averaging  to  hold,  the  phrase  "quite  a  bit" 
must  be  interpreted  as  a  relational  operator  referring  to  a  constant  proportion 
of  the  difference  between  the  new  and  the  old  value.  In  Step  8,  the  judge  selects 
the  next  (and  final)  stimulus  item.  This  is  then  evaluated  in  Step  9  and  adjusted 
for  in  Step  10  to  produce  a  final  impression  that  is  then  mapped  by  the  subject 
onto  whatever  overt  response  scale  has  been  provided.  Note  that  in  this  example 
I  have  shown  the  second  adjustment  (Step  9)  to  involve  a  smaller  proportion  of 
the  difference  between  the  old  impression  and  the  new  item  than  was  shown  in  St.p  7 
For  anything  approaching  arithmetic  averaging  to  hold,  there  would  need  to  be 
successively  smaller  adjustment  constants  at  each  step  with  values  of  about  a  half, 
a  third,  and  so  forth.  Ordinarily,  however,  averaging  tasks  display  primacy, 
which  Implies  adjustment  constants  that  are  even  smaller  than  this.  (More  will  be 
said  about  primacy  and  recency  effects  in  a  later  section  of  the  paper.) 

As  the  example  illustrates,  averaging  procedures  seem  intuitive  and  natural 
for  the  impression  formation  task,  as  they  would  for  many  other  integration  tasks. 
But  averaging  results  are  also  found  in  some  tasks  where  averaging  seems  Irrational 
A  case  in  point  is  the  Bayesian  inference  task  in  which  subjects  make  judgments 
concerning  the  relative  probability  of  two  Iwpothcses  (HI  and  H2)  based  on  one  or 
more  data  samples  (Dl,  D2 ,  etc.). 


1() 

According  to  Bayes'  theorem,  the  probability  of  HI  given  data  sample  D1  is 
as  follows: 


p(HllDl) 


p(Dl|Hl)p(Hl) _ 

F(D1 I  HI) p (Hi)  +  p (Dl ' H2 ) p (H2) 


(5) 


where  p(Hl)  and  p(H2)  are  the  prior  probabilities  of  the  two  hypotheses  and 
p(HljDl)  is  the  posterior  probability  of  HI.  If  a  second  sample  is  introduced, 
the  process  simply  reiterates  with  the  values  for  p(Hl|Dl)  and  p(H2|Dl) 
replacing  the  old  prior  probabilities.  That  is. 


p(H1Id1S.D2) 


p(d2|hi)p(hi!di) _ 

p(D2lHl)p(Hl|Dl)  +  p(D2|H2)p(H2|d1) 


(6) 


Looked  at  in  terms  of  adjustment  processes,  Bayes'  theorem  constrains  the 
direction  in  which  adjustments  can  occur.  This  is  easiest  to  see  by  simplifying 
and  then  rewriting  Equations  5  and  6.  First,  let  p(Dl] Hl)p(Hl)  be  replaced  by 
the  value  a  and  p(Dl|H2)p(H2)  by  the  value  _b: 


p(Hl|Dl)  =  ^ 


(7) 


Note  that  the  equation  reveals  a  simple  relative  ratio  form.  Now,  we  divide  both 
numerator  and  denominator  by  a  to  give: 

In  the  same  way,  p(H2|Dl)  =  b/ (a+b)  =  l/(l+a/b). 

Consider  how  a  new  sample,  D2,  would  affect  the  outcome.  Let  a'  stand  for 
p(D2|h1)  and  stand  for  p(D2|H2).  Substituting  into  Equation  6  we  obtain 


p(H1|d16.D2) 


-A:  +  b'-\ 

a+b  a+b 


a  a ' 

aa '+bb' 


(9) 


This  can  then  be  simplified  as  follows: 


_ 1 _ 

l+{b/a)(b'/a') 


(10) 
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Focusing,  lor  illustration,  on  HI,  notice  tliut  If  sample  IJ2  favors  HI,  i.e., 
b'/a'<  1,  then  the  value  of  p(Hl|ni&D2)  must  exceed  the  value  of  p(HljDl) 
since  the  denominator  term  becomes  smaller.  Thus,  anytime  a  new  sample  favors 
HI,  adjustments  must  be  toward  Increased  support  for  HI.  Under  averaging, 
however,  this  directional  constraint  need  not  hold.  In  particular,  if  the  current 
degree  of  support  for  HI  is  more  extreme  than  the  degree  of  support  offered  by 
the  new  evidence,  averaging-type  adjustment  will  operate  to  lower  the  degree  of 
support  for  HI . 

A  considerable  body  of  evidence  supports  the  notion  that  subjects  in 
Bayesian  inference  tasks  produce  data  that  are  more  like  averages  than  they  are 
like  the  relative  ratios  demanded  by  the  normative  theory  (Beach,  Wise,  &  Barclay, 
1970;  Marks  &  Clarkson,  1972,  1973;  Shanteau,  1970,  1972).  In  addition,  the 
current  research  program  has  demonstrated  quite  clearly  that  these  averaging-like 
results  can  actually  involve  directionally  inappropriate  adjustments  (Lopes, 

1981,  1982a).  In  other  words,  subjects  who  are  given  new  data  that  support  the 
currently  held  hypothesis  more  than  the  alternative  sometimes  actually  lower  their 
confidence  in  the  supported  hypothesis  and  increase  their  confidence  in  the  alter- 
nat ive . 

If  subjects  always  produced  averages  and  never  produced  relative  ratios, 
there  would  be  little  problem.  One  could  suppose  that  the  psychological  pressure 
to  make  adjustments  toward  the  scale  value  of  the  new  information  was  simply  so 
great  that  logical  factors  such  as  those  influencing  normative  theories  could  not 
compete.  But  subjects  sometimes  do  make  ratio-like  judgments  even  in  tasks  that 
are  formally  quite  similar  to  Bayesian  tasks.  The  question  then  arises  as  to 
the  determinants  of  averaging-like  and  ratio-like  processing.  This  is  an  important 
question,  but  it  is  best  deferred  until  relative  ratio  rules  have  been  discussed. 

Relative  Ratios 

Ratio  tasks  are  tasks  of  relative  judgment.  In  the  simplest  case,  there  are 
two  polar  alternatives  and  a  set  of  stimuli.  The  judge's  task  is  to  locate  the 
values  of  the  various  stimuli  relative  to  the  alternatives.  In  many  experimental 
tasks,  the  stimuli  are  generated  by  a  factorial  combination  of  features  that  vary 
in  the  degree  to  which  they  support  the  two  alternatives. 

As  noted  above,  the  Bayesian  inference  task  is  a  relative  ratio  task  in  which 
the  "features"  that  are  combined  factorially  are  probabilities  associated  with 
different  samples.  That  is,  if  we  look,  for  example,  at  the  three  terms  comprising 
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Equation  '>,  we  see  that  each  of  them  is  separately  the  joint  probability  of  the 
data  and  one  or  the  other  of  the  two  hypotheses. 

Relative  ratio  rules  need  not,  however,  involve  probabilities.  Oden  (1979), 
for  example,  has  used  a  relative  ratio  rule  to  model  the  degree  to  which  people 
judge  various  character-like  stimuli  to  be  instances  of  one  letter  or  another. 

His  model  is  a  fuzzy  logical  model  for  integrating  the  continuous  truth  values  of 
various  semantic  propositions  describing  the  candidate  patterns. 

Figure  3  shows  both  the  data  and  the  stimuli  from  Oden's  experiment:  Note 
that  each  stimulus  matrix  (upper  panel)  involves  two  features  that  vary  continuously 
across  a  factor.  For  example,  in  the  T/F  matrix  at  the  left,  the  row  feature  is 
the  degree  to  whlcli  the  left-hand  portion  of  the  T-bar  is  present  or  absent  and 
the  column  feature  is  the  degree  to  which  the  middle  F-bar  is  present  or  absent. 

In  the  matrix,  perfect  examples  of  F  and  T  are  located  in  the  upper  right  and  lower 
left  cells,  respectively.  The  other  stimuli  are,  to  varying  degrees,  less  than 
perfect  and  the  subject's  task  is  to  say,  for  each  stimulus,  the  degree  to  which 
it  is  a  T  or  an  F. 

Oden's  model  can  be  paraphrased  as  follows:  if  we  suppose  that  the  subject 
is  asked  to  rate  the  degree  to  which  a  given  stimulus  is  a  T  or  an  F  (i.e.,  the 
stimulus  "T"  would  be  rated  1  and  the  stimulus  "F"  would  be  rated  0),  then. 


Relative  T-ness  of  stimulus 


T-ness  of  feature  1 

_ F-ness  of  feature  1 _ 

T-ness  of  feature  1  ^  F-ness  of  feature  2 
F-ness  of  feature  1  T-ness  of  feature  2 


(11) 


A  relative  ratio  rule  of  this  sort  is  expected,  for  stimuli  such  as  Oden  used,  to 
produce  data  that  have  a  "barrel-shape"  when  they  are  graphed .  As  can  be  seen  in 
the  lower  panels  of  Figure  3,  Oden's  data  do,  indeed,  have  such  a  barrel-shape 
for  all  three  stimulus  matrices  and,  thus,  verify  that  people  have  access  to  ratio¬ 
like  judgment  processes. 

The  upper  panel  of  Figure  A,  however,  shows  the  data  pattern  that  is  produced 
by  naive  judges  in  the  Bayesian  task.  These  particular  data  were  taken  from  an 
experiment  in  which  subjects  had  to  judge  the  probability  that  a  machine  was  in 
need  of  repair.  The  stimuli  were  pairs  of  estimates  of  the  rate  at  which  the 
machine  produced  rejected  parts  (hopes,  1981a).  As  is  evident  graphically,  these 
data  are  essentially  parallel  —  indicative  of  an  averaging  process  —  rather  than 
barrel-shaped  as  Bayes'  theorem  requires. 


If  people  can  produce  relative  ration-,  why  don't  they  do  so  In  the  Bayesian 
task?  The  answer  seems  to  lie  in  the  relation  between  the  procedures  underlying 
ratio-like  responding  and  the  subject's  representat ion  of  the  stimulus  Informa¬ 
tion  and  the  response  scale.  There  appear  to  be  two  fundamental  differences 
between  relative  ratio  procedures  and  averaging  procedures.  The  first  of  these 
concerns  the  direction  in  which  adjustments  are  made  when  both  new  and  old  infor¬ 
mation  favor  the  same  hypothesis.  As  was  discussed  above,  adjustments  in  rela¬ 
tive  ratio  processes  must  always  be  made  in  the  direction  of  the  hypothesis  that 
is  favored  by  the  current  information.  In  contrast,  adjustments  in  averaging 
can  be  made  either  towards  or  away  from  the  hypothesis  favored  by  the  current 
information  depending  on  whether  the  current  information  is  more  or  less  extreme 
than  the  value  being  adjusted. 

The  major  purpose  of  the  Bayesian  inference  experiments  described  above  was 
to  find  out  whether  naive  subjects  could  be  made  to  be  better  Bayeslans  by  teach¬ 
ing  them  better  adjustment  strategies.  In  the  first  experiment,  two  groups  of 
subjects  were  given  sample  information  about  the  rate  of  rejected  parts  in  a 
sequential-presentation  format  and  were  asked  to  judge  the  probability  that  the 
machine  that  produced  the  samples  was  broken.  Half  the  subjects  were  given  no 
further  instructions;  these  were  the  control  group.  The  other  subjects  were 
given  instruction  concerning  the  prevalence  of  directional  adjustment  errors  and 
were  taught  the  proper  direction  of  adjustment  for  pairs  in  which  a  "weak" 
sample  favoring  a  given  hypothesis  follows  a  "strong"  sample  favoring  the  same 
hypothesis.  The  results  Indicated  that  the  instructions  did,  indeed,  greatly 
reduce  the  occurrence  of  directional  errors,  but  the  data  produced  were  still 
much  more  like  averages  than  like  relative  ratios. 

The  largest  discrepancies  appeared  to  involve  stimulus  pairs  in  which  one 
sample  favored  one  hypothesis  and  the  other  sample  favored  the  other  hypothesis. 

For  these  pairs,  subjects  appeared  to  give  the  samples  within  serial  position 
relatively  equal  weight  regardless  of  whether  they  were  extreme  or  not.  In  other 
words,  the  subjects  were  behaving  as  required  by  the  parallelism  principle 
described  in  the  section  on  constant  weighted  averaging.  Bayesian  Inference, 
on  the  other  hand,  requires  a  "differential  weighting"  strategy:  Samples  that 
are  extreme  in  one  direction  or  another  are  more  diagnostic  than  nonextreme 
samples  and  must  be  accorded  higher  weight  in  the  final  judgment. 

Experiment  2  (the  results  of  which  are  shown  in  Figure  4)  attempted  to  teach 
subjects  a  procedure  that  would  facilitate  differential  weighting  in  a  simultaneous- 
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presentation  task.  Basically,  this  new  procedure  retained  the  instructions  con¬ 
cerning  the  proper  direction  of  adjustment  and  added  only  the  idea  that  the 
subject  should  always  anchor  the  judgment  at  the  value  given  by  the  more  extreme 
stimulus  and  then  adjust  for  the  value  of  the  less  extreme  stimulus.  The  reason¬ 
ing  was  that,  since  primacy  is  a  commonly  observed  finding  in  judgment,  process¬ 
ing  stimuli  in  order  of  importance  might  be  cine  way  of  producing  differential 
weight ing . 

As  can  he  seen  in  the  lower  panel  of  Figure  A,  the  training  procedure  was 
quite  effective.  In  contrast  to  a  group  of  control  subjects  (top  panel),  the 
trained  subjects  produced  data  that  are  clearly  more  barrel-shaped.  In  fact, 
these  data  were  very  close  to  the  optimal  values  computed  from  Bayes'  theorem. 

It  appears,  then,  that  whether  or  not  subjects  produce  ratio-like  data  or 
averaging-like  data  involves  both  their  intuitions  about  the  direction  in  which 
judgments  should  be  made  and  their  intuitions  about  the  magnitude  of  the  adjust¬ 
ment  constants  (or,  equivalently,  about  the  order  in  which  stimulus  features 
should  be  processed).  To  understand  why  their  intuitions  are  appropriate  in 
some  tasks  but  not  others,  it  is  useful  to  compare  Oden's  task  situation  with 
the  typical  Bayesian  task  situation. 

In  Oden's  task,  the  stimulus  features  seem  to  have  a  bipolar  character. 

For  example,  presence  of  a  moderately  sized  middle  F-bar  in  a  particular  stimulus 
seems  to  be  simultaneously  evidence  for  F-ness  and  evidence  against  T-ness. 
Psychologically  speaking,  we  might  suppose  that  such  features  are  on  a  bipolar 
scale  than  runs  from  positive  to  negative.  In  contrast,  stimuli  in  Bayesian 
tasks  often  do  not  have  this  bipolar  character.  For  example,  suppose  that  the 
two  hypotheses  under  test  involve  whether  the  rejection  rate  for  a  machine  is 
10  parts  per  1000  (HlO/1000)  or  20  parts  per  1000  (H20/1000).  A  sample  with  an 
estimated  rate  of  16  parts  per  thousand  intuitively  favors  H20/1000  at  least 
slightly.  However,  it  does  not  seem  particularly  to  deny  the  possibility  of 
HlO/1000.  Thus,  Bayesian  stimuli  seem  to  lie  on  a  unipolar  scale  that  runs  from 
zero  to  positive. 

Whether  or  not  a  subject  evaluates  stimuli  on  a  bipolar  or  unipolar  scale 
seems,  intuitively,  to  be  an  important  determinant  of  whether  or  not  directional 
errors  occur  in  adjustment.  If,  for  example,  a  subject  has  coded  a  moderately- 
sized  F-bar  as  "F-like  and  not  T-like,"  it  seems  unlikely  that  the  subject  would 
ever  adjust  toward  T-ness.  On  the  other  hand,  if  such  a  mid-range  value  were 
to  be  coded  relative  to  a  neutral  point,  as  appears  to  be  the  case  when  subjects 
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judge  mid-range  samples  in  the  Bayesian  task  to  be  "a  little  bit  on  the 
H20/1000  side  of  neutral",  adjustment  might  easily  be  toward  the  middle  or 
neutral  range  of  the  response  scale,  as  would  be  required  for  averaging. 

The  issue  of  order  of  processing  in  the  two  tasks  also  seems  to  be  quite 
different.  In  Oden's  task,  one  seems  automatically  to  pick  out  the  more 
diagnostic  feature.  Hence  it  would  be  natural  to  use  it  as  an  anchor  and 
adjust  for  the  weaker  feature.  For  example,  the  stimulus 
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would  be  described  by  most  people  as  a  T  with  a  blot  on  the  vertical  bar, 
suggesting  that  the  full  length  left  T-bar  is  more  important  or  salient  than  the 
small  middle  F-bar.  In  judging  the  relative  T-ness  of  the  stimulus,  they  would 
consequently  anchor  on  the  value  given  by  the  T-bar  and  adjust  for  the  value  given 
by  the  F-bar.  Bayesian  stimuli,  in  contrast,  are  typically  perceptually  un¬ 
differentiated,  as  for  example,  in  the  stimulus  pair  below: 

14  rejects 

11  rejects 

Although  the  second  of  the  two  samples  is  more  diagnostic  (given  the  hypotheses 
described  above),  the  eye  is  not  drawn  to  it  and  there  is  little  reason  to  suppose 
that  a  naive  subject  would  have  any  particular  tendency  to  consider  it  more  impor¬ 
tant  or  to  process  it  first.  Thus,  Oden's  task  nay  tend  to  predispose  people  to 
process  the  stimulus  components  in  an  order  that  would  facilitate  the  differential 
weighting  necessary  for  a  relative  ratio  rule.  In  contrast,  the  typical  Bayesian 
task  would  tend  to  predispose  people  to  process  the  stimuli  in  an  order  more 
amenable  to  constant  weighted  averaging. 

Finally,  a  third  factor  that  may  differentiate  between  Oden's  task  and  the 
typical  Bayesian  task  may  be  that,  for  naive  subjects,  the  respense  scale  in 
Bayesian  tasks  is  often  confusable  with  an  estimation  scale.  As  is  evident  in 
the  literature  (Beach,  Wise,  f»  Barclay,  1970;  Marks  &  Clarkson,  1972,  1973; 
Shanteau,  1970,  1972),  judgments  in  Bayesian  tasks  are  not  only  more  often  like 
averages  than  like  relative  ratios,  they  are  also  more  like  averages  of  estimates 
than  they  are  like  averages  of  inferences.  For  example,  in  Lopes  (1981),  It 
appeared  that  subjects  in  a  serial -presentat ion  version  of  the  machine  task  simply 
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treated  the  response  scale  as  running  linearly  Iron  10  rejects  (on  the  left) 
to  20  rejects  (on  the  right)  rather  than  from  p(H20/1000)  =  0  to  p(H20/1000) 

=  1.  In  particular,  their  anchor  judgments  were  proportional  to  the  nur.her 
of  rejects  in  the  anchor  sample  rather  than  being  numerically  at  all  like  the 
normatively  expected  inferences. 

Mult iply ing 

There  are  several  good  examples  of  tasks  in  which  naive  subjects  produce 
judgments  that  look  as  if  they  had  been  generated  by  multiplying.  Prominent  among 
these  are  the  risky  decision  task  in  which  subjects  judge  the  worth  of  gambles 
(Anderson  &  Shanteau,  1970;  Shanteau,  197A;  Tversky,  1967)  and  the  joint  proba¬ 
bility  task  in  which  subjects  judge  the  likelihood  of  joint  events  (Beach  £■ 

Peterson,  1966;  Lopes,  1976;  Shuford,  1959). 

Figure  5  gives  some  data  from  a  joint  probability  task  in  which  experienced 
poker  players  were  asked  to  judge  the  likelihood  that  they  could  beat  pairs  of 
opposing  stud  poker  hands  with  a  pair  of  sevens.  The  stimulus  pairs  were  composed 
using  an  11  x  3  factorial  design  in  which  the  levels  of  the  two  factors  were  poker 
hands,  each  described  by  four  up-cards,  a  bet  amount,  and  the  playing  style  of  the 
opponent.  For  present  purposes  there  is  no  need  to  get  into  the  particular  coding 
used  to  describe  the  hands.  Suffice  it  to  say  that  the  hands  listed  along  the 
abscissa  from  left  to  right  have  decreasing  probability  of  beating  a  pair  of  seve*. 
as  do  the  row  hands  listed  from  bottom  to  top. 

The  experiment  for  which  data  are  shown  contrasted  two  diCi.  t  nt  res,"  r. ,  e 
modes,  a  rating  response  in  the  top  panel  and  a  betting  response  In  the  bottjm 
panel.  As  can  be  seen,  the  data  from  both  conditions  show  the  characteristic 
"fan  shape"  of  a  multiplicative  process  (cf.  Anderson.  1981,  Section  l.A).  This 
result  suggests  that  poker  players  consider  the  likelihood  of  beating  a  pair  of 
opposing  hands  to  be  proportional  to  the  product  of  the  likelihoods  of  beating  the 
single  hands. 

The  question  of  interest  for  present  purposes  is  how  these  data  were  generated. 
Although  poker  players  are  generally  more  familiar  with  probabilistic  notions 
than  nongamblers,  there  did  not  appear  to  be  any  indication  that  they  were  assign¬ 
ing  probabilities  to  the  hands  and  multiplying  them.  Instead,  they  appeared  to 
be  using  an  analog  serial  fractionation  strategy  that  is  mathematically  equivalent 
to  multiplying  (Lopes,  1976;  Lopes  &  Ekberg,  1980)  . 

In  serial  fractionation,  the  Judge  Is  assumed  to  anchor  the  judgment  at  the 


L 


26 


valuf  of  .itio  stiniiilus  and  then  adjust  tfiis  value  downward  (i.e.,  toward  zero  on 
the  scale)  directly  in  proportion  to  the  value  of  another  stimulus.  If  there 
are  more  than  two  stimuli  to  he  integrated,  this  proportional  fractionation 
operation  is  repeated  until  all  stimuli  have  been  processed.  Thus,  serial  frac¬ 
tionation  is  simply  a  special  case  of  anchoring  and  adiustment  in  which  the 
adjustments  are  always  downward  and  always  in  proportion  to  the  value  of  the 
stimulus  being  integrated. 

As  it  happened,  subjects'  use  of  the  serial  fractionation  strategy  was 
directly  observable  in  the  bet  condition  of  the  poker  experiment  described  above. 

The  experiment  was  run  manually  without  computer  assistance.  In  the  task  the 
subjects  were  required  to  bet  between  1  and  30  cents  that  they  could  beat  each 
pair  of  opposing  hands.  The  hands  were  printed  separately  on  4  x  6  cards  and 
were  displayed  simultaneously  on  wooden  stands  that  were  located  side  by  side  in 
front  of  the  subject.  Between  the  subject  and  the  stands  was  a  long  strip  of 
paper  divided  into  30  segments  and  numbered  from  1  on  the  left  to  30  on  the  right. 
The  subject  was  to  Indicate  his  bet  by  sliding  a  response  marker  (i.e.,  a  poker 
chip)  into  the  segment  labeled  with  his  intended  bet  amount. 

A  common  response  pattern  for  subjects  was  the  following.  First,  they  looked 
at  one  of  the  two  stimulus  hands.  (Since  these  were  printed  on  fairly  large 
cards,  it  was  possible  to  know  simply  from  head  movements  where  subjects  were 
looking.)  Then  they  located  a  position  on  the  betting  scale  for  this  first  hand 
using  the  response  marker.  Next  they  looked  at  the  second  stimulus  hand.  And 
last  they  moved  the  response  marker  downward  from  its  initial  position  to  a  final 
position. 

Note  that  this  response  pattern  appears  to  be  an  overt  instantiation  of  serial 
fractionation.  If  it  were  being  done  on  a  probability  rating  scale,  the  result 
would  be  a  normatively  appropriate  response  computed  without  either  numerical  pro¬ 
cesses  or  the  necessity  of  prior  knowledge  of  the  normative  rule  for  joint  proba¬ 
bility.  On  a  bet  scale,  the  response  is  nonoptimal  in  the  sense  that  it  constitutes 
a  form  of  matching  rather  than  maximizing.  However,  the  response  may  bo  sensible 
from  other  perspectives  a.s,  e.g.,  in  reducing  the  variance  of  outcomes. 

Given  subjects'  clear  use  of  the  response  scale  in  generating  tleir  bets, 
a  natural  question  arises  concerning  whether  the  process  can  occur  in  tasks  for 
which  there  is  no  scale  to  be  fractionated  physically.  There  is  evidence  that 
the  answer  Is  yes.  Although  most  of  the  studies  that  have  obtained  multiplicative 
data  have  used  physical  rating  scale.?,  there  are  some  that  have  not.  For  example, 
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in  another  version  of  the  poker  task  nntler  discussion,  subjects  typed  their  bet 
amounts  into  a  computer  console  numerically  and  still  produced  multiplicative 
data  (Lopes,  1S>76,  Lxperiment  3).  Similarly,  Slovic  and  Lichtenstein  (1968, 
reported  in  Shanteau,  1975)  got  multiplicative  data  using  a  verbal  bidding  response. 

Joint  probability  tasks  such  as  the  poker  task  are  a  special  case  for  serial 
fractionation  in  the  sense  that  cither  stimulus  can  be  used  as  the  anchor.  The 
risky  decision  task  differs  from  this  in  that  the  two  parts  of  a  gamble  (e.g., 

1/6  to  win  $13)  lie  on  psychologically  different  scales:  the  prize  or  amount  to 
be  won  or  lost  lies  on  the  value  scale  whereas  the  probability  information  lies 
on  a  probability  scale.  This  differentiation  in  the  informational  content  of  the 
stimulus  items  has  important  consequences  for  the  order  in  which  the  items  are 
likely  to  be  processed. 

As  Lichtenstein  and  Slovic  (1971)  noted  some  time  ago,  subjects  in  risky 
decision  tasks  tend  to  use  the  amount  (or  prize)  to  be  wen  as  an  anchor  and  then 
adjust  this  to  account  for  the  probability  of  winning.  This  value/probability 
ordering  makes  good  intuitive  sense  for  serial  fractionation  since  the  task  is  to 
Integrate  value  and  probability  information  in  such  a  way  that  the  final  response 
lies  on  the  same  value  dimension  as  the  amount  to  be  won. 

For  illustration,  consider  the  data  in  the  upper  panel  of  Figure  6.  In  this 
experiment  Shanteau  (197A)  asked  subjects  to  rate  the  worth  of  verbal  bets 
described  by  phrases  like  "somewhat  unlikely  to  win  sandals."  He  used  a  continuous 
response  scale  that  ran  from  "worthless"  at  the  left  to  "sure  thing  to  win 
television"  at  the  right.  Now  consider  how  a  subject  might  operate  on  this  scale 
to  judge  the  worth  of,  e,g.,  "fairly  likely  to  win  watch."  The  most  obvious  pro¬ 
cedure  would  seem  to  be  to  use  "watch”  as  the  anchor  and  locate  its  value  relative 
to  the  end-marker,  i.e.,  the  television.  This  operation  would  yield  a  quantity 
on  the  value  scale  that  could  then  be  fractionated  proportionately  in  accord  with 
"fairly  likely  to  win."  The  result  would  be  another  quantity  that  is  still  on  the 
value  scale.  More  concretely,  if  we  assume  that,  e.g.,  for  a  given  subject  a 
watch  is  about  40%  as  valuable  as  a  television  and  "fairly  likely  to  win"  is  about 
80%  of  certainty,  then  the  serial  fractionation  process  would  take  the  form 
schematized  in  the  lower  panel  of  Figure  6. 

It  is  important  to  note  that  this  order  of  processing  is  not  a  mathematical 
necessity.  One  could  fractionate  the  response  scale  first  proportionally  to  the 
probability  information  and  then  proportionally  to  the  relative  worth  of  the 
amount  to  be  won.  However,  such  a  probab  11 ity/valuc  ordering  seems  unnatural  at 
best.  Not  only  would  svich  an  ordering  contradict  Lichtenstein  and  Slovic' s  (1971'' 
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observation  that  subjects  anchor  on  the  amount  to  be  wi>n  in  judgment  tasks,  but 
It  would  contradict,  as  well,  the  finding  of  hopes  and  hkberg  (1980)  that  value 
information  appears  to  be  processed  before  probability  information  when  subjects 
have  to  choose  between  gambles  and  sure  things. 

Serial  fractionation  tasks  are  special  in  the  sense  that  all  adjustments  in 
the  process  should  be  downward,  toward  the  zero  point  on  the  scale.  There  are, 
however,  other  potentially  multiplicative  tasks  that  involve  more  general  scalar- 
like  processes  in  which  adjustments  are  not  constrained  to  be  downward.  For 
example,  Anderson  and  Butzin  (1974)  gave  subjects  infonriation  about  the  motivation 
and  ability  of  athletes  and  asked  them  to  predict  their  pciformance  in  a  co  lege 
track  tryout.  They  found  a  multiplicative  relationship  between  motivation  and 
ability.  Oraossor  and  Anderson  (1974)  found  a  similar  relationship 
between  Income  and  generosity  when  subjects  were  asked  to  predict  how  large  a  gift 
a  person  would  give  to  a  worthy  cause. 

In  scalar  tasks  such  as  these  it  is  possible  that  subjects  anchor  their  judg¬ 
ment  on  an  estimate  given  by  one  of  the  quantities  and  then  adjust  this  according 
to  the  other  quantity.  For  example,  it  would  seem  reasonable  to  anchor  at  the 
value  given  by  ability  in  Anderson  and  Butzin's  task  or  at  the  value  given  by  in¬ 
come  in  Graesser  and  Anderson's  task  since  these  quantities  lie  on  the  scale  of 
judgment.  Adjustment  could  then  be  made  for  motivation  and  generosity,  respectively. 
There  are,  however,  some  difficulties  in  imagining  how  scalar  multiplication  would 
work  for  upward  adjustments.  One  possibility  would  be  to  fractionate  out  a 
portion  of  the  anchor  quantity  and  then  add  this  on  so  that  the  final  quantity 
would  be,  e.g.,  one-and-a-half  times  the  original.  In  cases,  however,  where  the 
incrementing  is  by  whole  Integers,  e.g.,  increasing  a  quantity  by  three,  it  is 
not  particularly  intuitive  to  me  that  this  is  done  by  successive  additions  of  the 
anchor  quantity. 

A  question  that  has  arisen  especially  with  respect  to  multiplicative  models 
is  whether  human  judgment  processes  display  the  sorts  of  consistency  and  reversi¬ 
bility  that  characterize  formal  algebra.  In  particular,  there  has  been  consider¬ 
able  interest  in  determining  whether  subjects  have  dividing-like  processes  that 
they  can  apply  when  asked  for  inverse  judgments.  For  example,  if 

Performance  =  Motivation  x  Ability,  (12) 


mathematical  algebra  would  require  inferences  about  ability  given  motivation  and 
performance  to  follow  the  rule 
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Ability  =  Performancc/Motivat ion  .  (13) 

Do  people's  intuitions  about  the  algebra  of  real  world  situations  agree  with 
this  rule?  Put  in  terms  of  processes  Che  question  becomes  not  only  whether  sub¬ 
jects  have  a  procedure  that  is  equivalent  to  analog  division,  but  also  whether 
they  have  a  complete,  appropriate,  and  reversible  mapping  between  the  elements 
and  the  operators  in  their  judgment  system  (i.e.,  values,  adjustments,  etc.)  and 
the  objects  and  operators  of  real  world  task  domains  (i.e.,  people  giving  money 
to  charity  or  trying  out  for  track  meets) . 

Anderson  and  Butzin  (197?i),  Craesser  and  Anderson  (1974)  and  Surber  (1980) 
have  all  Investigated  this  question.  The  general  conclusion  seems  to  be  that 
natural  judgment  processes  do  not  display  the  sorts  of  consistency  and  reversi¬ 
bility  that  characterize  formal  mathematics.  For  example,  in  the  Oraesser  and 
Anderson  experiment,  subjects  appeared  to  combine  income  and  generosity  multi- 
plicatively  to  produce  judgments  of  expected  gift  size,  ^^^1en  judging  income  from 
gift  size  and  generosity,  however,  or  generosity  from  gift  size  and  income,  they 
appeared  to  follow  some  sort  of  subtracting  rule.  Thus,  the  human  judgment  system 
appears  to  comprise  a  set  of  judgmental  heuristics  that  often  correspond  to 
simple  arithmetical  operations  but  that  tend  to  be  both  nonreversible  and  logically 
unrelated . 


Adjustment  Procedures  and  Judgmental  Phenomena 

Research  on  algebraic  judgment  models  has  two  different  aspects.  One  is 
the  investigation  and  description  of  classes  of  integration  rules,  such  as  the 
averaging,  relative  ratio,  and  multiplying  rules  just  discussed.  The  other  is 
the  specification  of  the  conditions  under  which  certain  more  general  judgmental 
phenomena  occur.  Some  of  these  phenomena  represent  classic  topics  in  judgment, 
such  as  primacy/recency,  contrast,  assimilation,  and  halo  effects.  Other  pheno¬ 
mena,  however,  have  come  to  light  only  as  a  re.sult  of  the  power  afforded  by  the 
algebraic  analysis.  These  more  recently  described  phenomena  Include  Initial 
impression  effects,  differential  weighting,  subadditivity,  and  nonadditivity. 

In  this  section  I  will  show  how  four  of  these  phenomena  (primacy/recency, 
initial  impression  effects,  differential  weighting,  and  nonadditivity)  can  be 
understood  in  terms  of  the  procedural  model.  Since  most  of  the  work  on  thc.se 
phenomena  has  been  in  the  context  of  averaging,  1  will  not  consider  the  other 
models  further. 
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Prlmacy/Ki'ioncy 

One  of  the  earliest  findings  in  impression  formation  research  was  a  tendency 
toward  prir.'.acy:  The  earlier  the  position  of  an  adjective  in  a  trait  list,  the 
greater  its  effect  on  the  judgment.  Algebraically,  primacy  effects  consist  of  a 
tendency  for  the  weight  of  the  adjectives  to  be  a  monotonically  decreasing  func¬ 
tion  of  serial  position. 

Several  different  theoretical  interpretations  have  been  offered  for  primacy 
including  inconsistency  discounting,  attention  decrement,  and  a  generalized  form 
of  assimilat ion-to-expectancy  termed  directed  impression  (see  Anderson,  1981, 
section  3.3  for  a  complete  historical  summary  of  this  literature).  These  inter¬ 
pretations  were  tested  in  a  variety  of  ways,  with  the  end  result  being  that  most 
studies  favored  the  attention  decrement  hypothesis. 

One  of  the  experimental  manlpvlatlons  that  proved  to  be  important  conceptually 
Involved  modifying  the  Impression  formation  task  in  a  way  that  forced  subjects  to 
attend  to  information  later  in  the  list.  Two  effective  procedures  were  to  present 
stimuli  serially  and  have  subjects  either  read  all  the  stimulus  items  aloud 
(Anderson,  1968  )  or  make  a  cumulative  response  after  each  stimulus  item  (Stewart, 
1965).  Under  these  conditions,  primacy  was  replaced  by  recency,  thus  confirming 
the  attention  decrement  hypothesis. 

Two  procedural  questions  of  considerable  interest  are  (a)  what  processes 
mediate  attention  decrement  under  conditions  of  ordinary  simultaneous  presentation, 
and  (b)  what  operations  produce  recency  in  the  case  of  cumulative  responding  and 
other  attent ion -demanding  conditions?  For  the  primacy  results,  the  best  answer 
seems  to  be  one  advanced  by  Anderson  (1981,  p.  191),  namely,  that  there  is  an  in¬ 
creasing  tendency  during  judgment  for  the  impression  being  constructed  to  crystal¬ 
lize  and  become  resistant  to  change. 

In  terms  of  adjustment  processes,  the  crystallization  hypothesis  suggests  that 
as  a  direct  function  of  the  cumulative  amount  of  information  already  processed, 
the  serial  adjustment  constants  become  systematically  smaller  than  the  1/n  required 
for  arithmetic  averaging.  This  seems  perfectly  reasonable.  In  addition,  the  pro¬ 
cedural  theory  schematized  in  Figure  1  suggests  another  source  of  primacy,  namely, 
that  subjects  might  partially  or  totally  ignore  later  items  in  a  list  if  they  seem 
relatively  unimportant.  One  way  this  could  work  would  be  for  subjects  having  a 
moderately  well-crystallized  impression  merely  to  scan  later  items.  If  the  scan 
does  not  reveal  any  item  or  items  that  would  be  likely  to  produce  a  large  change 
in  the  impression,  the  subject  might  then  choose  not  tc  adjust  the  impression 
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further.  Siu-h  a  process  woulJ  not  only  be  computationally  efficient,  it  would 
also  capture  procedural ly  the  important  Insight  that  averages  generally  become 
more  and  more  resistant  to  change  as  more  and  more  information  is  included  in  them. 

For  recency  effects,  the  situation  is  somewhat  different.  To  see  how,  con¬ 
sider  once  again  the  number  averaging  task.  Suppose  that  a  subject  is  given  a  list 
of  20  two-digit  integers  one  at  a  time  and  asked  to  give  a  running  average.  As 
noted  previously,  the  adjustment  constants  in  such  a  task  would,  ideally,  begin 
with  1/2  and  then  decrease  to  1/3,  1/4,  etc.  out  to  the  final  value  of  1/20.  A 
few  minutes  play  with  such  a  running  average  should  convince  the  reader  that  the 
adjustments  in  such  a  task  typically  become  very  small  very  quickly. 

Now  consider  the  position  of  a  subject  in  such  a  task  who  is  required  to 
respond  after  each  stimulus.  Sooner  or  later  tlie  subject  will  reach  a  psychological 
"grain  size"  for  adjustments  beyond  which  he  will  not  go;  in  other  words,  at  some 
point  the  subject  will  begin  producing  adjustments  of  relatively  constant  size  that 
track  the  d irect ion  in  which  the  current  adjustment  should  be  made  but  that  greatly 
over-adjust  in  terms  of  magnitude.  This,  by  itself,  would  produce  recency.  Indeed, 
it  is  difficult  to  see  how  a  cumulative  response  requirement  cculd  fail  to  produce 
recency  in  tasks  Involving  more  than  a  few  stimuli.  For  the  condition  in  which 
stimuli  are  merely  pronounced  by  the  subject  but  not  tracked  overtly,  the  "grain 
size"  problem  might  not  hold,  in  which  case  recency  might  reflect  some  other  sort 
of  attentional  effect.  However,  it  is  at  least  plausible  that  the  pronouncing 
condition  does  cause  subjects  to  produce  a  cognitively  d iscriminable  internal  re¬ 
sponse  for  each  stimulus  item,  in  which  case  the  final  result  would  be  essentially 
identical  to  that  for  overt  cumulative  responding. 

Initial  Impression  Effects 

One  of  the  early  controversies  In  the  history  of  impression  formation  research 
concerned  whether  personality  impressions  follow  an  adding  or  an  averaging  rule. 
Although  there  were  several  impressive  results  arguing  for  averaging,  some  re¬ 
searchers  pointed  to  a  common  set-size  effect  as  proof  of  adding  (sec  Anderson, 

1981,  Section  2,  for  a  summary). 

Basically,  the  set-size  effect  was  that  when  subjects  were  given  n  adjectives 
of  Identical  value  and  were  asked  to  rate  their  impressions,  the  impressions  grew 
more  extreme  as  a  function  of  n.  For  example,  if  all  the  adjectives  were  positive, 
the  response  would  be  hlglier  for  three  adjectives  than  for  two.  Obviously 

if  subjects  were  simply  averaging  the  scale  v.-ilues,  the  result  would  nut  differ  for 
different  numbers  of  adjectives. 
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As  it  turned  out,  the  averaging  model  was  able  to  handle  such  set-size  effects 
with  the  assumption  that  the  subject  begins  each  judgment  with  a  neutral  initial 
impression  that  receives  some  small,  but  not  zero,  weight  in  the  averaging  process. 
As  successive  adjectives  are  then  integrated,  the  cumulative  weight  of  the  adjec¬ 
tives  increases  relative  to  the  weight  of  the  initial  impression  with  the  result 
that  the  response  becomes  more  extreme, 

A  considerable  literature  now  exists  supporting  the  initial  impression  con¬ 
struct.  However,  there  are  some  tasks  in  which  there  are  no  initial  impression 
effects  (e.g.,  number  averaging,  psychophysical  averaging,  most  tasks  in  Bayesian 
inference),  so  the  question  may  be  raised  as  to  when  initial  impression  effects 
are  to  be  expected  and  when  they  are  not. 

The  answer  may  reflect  the  subject's  understanding  of  the  judgment  dimension. 

In  the  case  of  impression  formation,  for  example,  the  subject  is  asked  to  rate  the 
overall  likeableness  of  a  person  given  specific  information  about  traits.  It 
seems  self-evident  that  subjects  would  recognize  that  no  single  trait  conveys  all 
the  relevant  information  concerning  a  person's  likeableness.  Thus,  the  tendency 
for  people  to  anchor  at  a  value  somewhat  less  extreme  than  the  scale  value  of  the 
anchor  stimulus  may  represent  a  procedural  means  for  expressing  the  uncertainty 
surrounding  all  the  important  dimensions  not  yet  described.  For  tasks  such  as 
number  averaging,  however,  the  judgment  dimension  is  unidimensional  so  that  there 
is  no  apparent  need  to  "hedge"  in  the  anchoring  process:  the  anchor  is  simply 
set  at  the  value  of  the  first  stimulus  processed. 

A  question  of  some  interest  for  the  procedural  model  is  whether  initial 
impression  effects  are  mediated  by  pre-judgmental  Implicit  anchoring  at  the  initial 
impression  with  subsequent  adjustment  toward  the  first-processed  stimulus  item  or 
whether  they  are  mediated  by  explicit  anchoring  at  the  scale  value  of  the  first- 
processed  stimulus  item  with  subsequent  adjustment  toward  neutral.  (Note  that  the 
latter  process  would  be  conceptually  similar  to  "discounting"  the  anchor  stimulus.) 

One  piece  of  evidence  that  bears  on  this  question  comes  from  a  controversial 
experiment  (Shanteau,  1975)  that  tested  the  averaging  hypothesis  for  inference 
judgment.  In  Bayesian  Inference,  nondiagnostic  (i.e.,  neutral)  information  should 
have  no  weight  whatsoever  in  the  judgment  process.  Shanteau  used  this  fact  in  a 
clever  experiment  to  get  a  qualitative  test  of  his  averaging  model  for  Inference. 

He  presented  subjects  with  various  sequences  of  sample  information  and  asked  them 
for  cumulative  inferences.  He  found  that  subjects  tended  to  adjust  tlieir  inferences 
toward  neutral  after  tlie  presentation  of  nondiagnostic  samples.  This  would  be 


disallowoll,  ot  course,  by  the  Hayo^;i;^n  rule,  but  it  is  cxai  I  1  y  wii.it  one  wm- ■  1 
expect  of  an  averaging  ruli>. 

Wallsten  (l‘l7ba)  objected  to  Shanteau's  analysis  by  pointing  out  that  tin 
averaging  model  only  accovinted  for  the  data  when  the  nond iagnost ic  information 
followed  one  or  more  pieces  of  diagnostic  information.  When  the  nondiagnostic 
sample  was  given  first,  it  appeared  to  have  no  weight  at  all  in  the  judgment. 
Specifically,  the  cumulative  response  to  a  two-item  scries  in  which  a  nond,  i  agnos¬ 
tic  sample  preceded  a  sample  of  given  diagnosticity  was  essentially  identical  to 
the  response  to  a  one-item  series  consisting  only  of  the  diagnostic  sample.  This 
result  is  in  accord  with  Bayes'  theorem  and  not  with  an  averaging  rule. 

These  data  suggest  a  tendency  for  people  to  resist  anchoring  at  a  neiit ra  1 
value  even  when  the  potential  anchor  stimulus  is  an  explicitly  presenteti  stimulus 
item.  If  this  is  so,  it  would  suggest  that  the  explicit  procedure  for  initial 
impression  effects  would  bo  more  plausible  than  the  implicit  procedure  since  the 
latter  would  require  subjects  to  anchor  not  only  at  a  neutral  value,  but  at  an 
Implicit  neutral  value  to  boot.  Obviously,  the  ease  for  explicit  anclioring  is  not 
strong,  especially  as  it  rests  on  generalizing  a  result  from  a  task  in  which 
initial  impression  effects  do  not  ordinarily  occur^  to  a  task  in  which  they  are 
quite  common.  It  seems  plausible,  in  fact,  that  nondiagnosticity  (as  exists  in 
the  inference  domain)  might  be  treated  quite  differently  than  neutrality  (as 
exists  in  the  trait  adjective  domain).  Since  nondiagnosticity  is  literally 
content-less  relative  to  the  issue  at  hand  (i.e.,  which  hypothesis  is  supporied), 
it  may  be  relatively  easily  ignored  when  it  occurs  in  the  first  position.  In 
contrast,  neutrality  (i.e.,  that  people  tend  to  he  averagely  likeable)  has  content 
relative  to  1 Ikeablencss .  Hence,  we  may  bt  willing  to  anchor  on  it  both  when  it 
is  presented  explicitly  and  when  it  merely  exists  as  an  initial  impression. 

Differential  weighting 

As  stated  above,  the  most  common  form  of  averaging  model  is  constant  weighted 
averaging.  Ordinarily,  averaging  models  are  expressed  with  referem-e  to  designs 
in  which  the  various  factors  represent  the  serial  positions  in  which  stimuli  .ippoar 
in  a  temporal  series  or  in  a  printed  list.  The  equation  describing  the  constant 
weighted  averaging  model  for  a  factorial  design  would  be  written. 


>-w,s  . 

1=1  i  ^.1 
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This  shows  ih.it  the  factor  weights,  w^,  vary  only  with  the  factor,  i,  whereas 
the  scale  values,  s ,  vary  with  both  the  factor  and  the  level,  j,  within  factor. 

There  .ire  several  theoretical  and  practical  factors  that  make  it  desirable, 
when  possible,  to  set  up  experiments  so  that  constant  weighting  will  hold.  However, 
the  averaging  rule  can  be  written  more  generally. 


i=l 


(15) 


i=l 


to  allow  weights  to  vary  not  only  between  factors,  but  within  factors  as  well. 

In  this  form  the  averaging  rule  would  bt  able  to  capture  virtually  any  psychological 
factor  that  might  affect  tlie  weight  of  a  stimulus  item,  e.g.,  reliability,  source 
credibility,  novelty,  extremity,  and  so  forth. 

A  form  of  differential  weighting  that  has  appeared  fairly  frequently  in  the 
literature  is  what  has  been  termed  a  "negativity  effect"  (Anderson,  1981,  Section 
4.4.2).  The  observation  is  that  stimulus  weights  seem  to  increase  as  a  function 
of  their  negativity.  In  a  factorial  plot  this  distorts  parallelism  by  causing  a 
convergence  of  the  data  points  for  stimuli  containing  negative  items. 

For  Illustration,  consider  a  stimulus  set  generated  by  factorial  combination 
of  three  sets  of  adjectives: 


Factor  1 


Factor  2 


Factor  3 


careful 

careless 


Intel 1 igent 
gullible 


friendly 

cruel 


The  stimulus  combinations  that  would  result  would  be. 


Intel! igent 

gullible 

intell igent 

gullible 

careful 

careful 

careless 

careless 

friendly 

f  riendly 

f  riendly 

friendly 

in tell igent 

gullible 

intelligent 

gull Ible 

careful 

careful 

careless 

c.arel  ess 

cruel 

cruel 

cruel 

cruel 

For  the  stimuli  in  the  upper  row,  there  are  clear  differences  in  the  scale  values 
of  the  adjectives.  There  may  also  be  differen  -s  in  importance  or  salience. 
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For  exampto,  intel  I  if^enco  and  friendliness  might  he  considered  more  important 
than  carefulness  in  judging  the  1 ikeahleness  of  a  person.  However,  these  parti¬ 
cular  differences  in  weight  would  probably  not  be  great  enough  to  prevent  one 
from  simply  integrating  the  adjectives  in  order  as  they  appear  in  the  stimulus, 
i.e.,  using  the  default  order.  For  the  stimuli  in  the  bottom  row,  however,  the 
situation  is  different.  The  adjective  "cruel"  is  both  extremely  negative  and 
highly  salient.  Thus,  a  common  pattern  would  probably  be  for  subjects  to  judge 
these  stimuli  by  anchoring  on  "cruel"  and  adjusting  for  the  other  two  adjectives. 

Now  consider  the  effect  that  such  an  ordering  difference  would  have  in  the 
analysis  of  the  factorial  design.  Ordinarily,  the  weights  of  the  various  adjec¬ 
tives  would  reflect  their  serial  position  in  the  stimulu.s  so  that  w^^'^w^^w^. 

In  the  present  case,  however,  the  adjective  "cruel,"  which  by  virtue  of  its  serial 
position  has  weight  w^,  is  pushed  to  the  top  of  the  processing  order  where  it 
receives  weight  w^^.  In  contrast,  its  factor-mate,  "friendly,"  is  processed  in 
the  default  order  so  that  it  always  receives  weight  w^ .  Thus,  a  negativity  effect 
would  occur  in  which,  for  example,  there  would  be  less  difference  between  the 
responses  for  careful/intelligent/cruel  and  careless/intelligent /cruel  than  for 
careful/ intelligent/friendly  and  careless/intelligent/f riendly .  (Note  that 
ordering  effects  need  not  be  the  only  source  of  increased  weighting  for  salient 
adjectives.  The  possibility  of  direct  weighting  effects  will  be  discussed  below.) 

The  algebraic  differences  between  Equation  lA  and  Equation  15  arc  very  small, 
amounting  to  no  more  than  a  restriction  on  whether  the  stimuli  within  an  experi¬ 
mental  design  factor  can  vary  on  weight  or  not.  Are  the  conceptual  differences 
equally  small?  In  terras  of  the  serial  adjustment  model,  the  answer  is  yes. 

Subjects  do  not  "have"  a  constant  weighted  averaging  process  and  a  differential 
weighted  averaging  process  that  they  apply  selectively  to  different  tasks. 

Instead,  they  have  a  general  judgment  procedure  that  functionally  produces  averages. 
Whether  those  averages  show  constant  weighting  or  differential  weighting  is  not 
primarily  the  result  of  what  the  subject  does,  but  rather  reflects  how  the  experi¬ 
menter  constructs  and  analyzes  the  stimulus  design. 

This  point  can  be  illustrated  by  consideration  of  an  experitrent  by  Anderson 
and  Lopes  (1974)  in  which  subjects  made  judgments  of  the  occupational  proficiency 
of  various  people  (c.g,,  lawyer  and  plumber)  based  on  information  about  their 
degree  of  reliability  (average  vs.  extreme)  and  their  degrees  (very,  moderately,  or 
not-very)  of  competence  on  skill  factors  (e.g.,  persuasiveness  and  mechanical 
ability).  In  this  experiment  it  was  anticipated  that  the  relative  weight  of  the 
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Figure  7.  Example  of  differential  weighting  in  averaging. 
T.opes,  1974 . 


From  Anderson  and 
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^.•ompetetu■e  factors  would  reflect  the  inportance  of  those  factors  to  the  profession 
being  judged. 

As  can  be  seen  in  figure  7,  this  is  basically  what  happened.  Taking  the  case 
of  persuasiveness,  for  example,  wo  can  see  that  the  slopes  of  the  two  curves  for 
reliability  are  very  steep  and  very  close  together  when  reliability  and  persuasive¬ 
ness  are  combined  for  lawyers.  '.Then  they  are  combined  for  plumbers,  hot;ever,  the 
curves  are  further  apart  and  virtually  flat.  Both  these  results  reflect  the 
increased  weight  assigned  to  persuasiveness  for  lawyers.  Likewise,  the  importance 
of  mechanical  ability  for  plumbers  is  indicated  by  the  steepness  and  closeness 
of  the  curves  for  reliability  when  reliability  and  mechanical  ability  are  combined. 

How  are  these  effects  produced?  According  to  the  serial  adjustment  model 
the  effects  would,  at  least  in  part,  reflect  processing  order.  (A  furtler  source 
of  weighting  effects  will  be  discussed  below.)  For  example,  if  Information  is 
given  about  a  lawyer's  persuasiveness  and  reliability,  the  judgment  would  probably 
be  anchored  on  the  persuasiveness  value  and  adjusted  for  reliability.  Given 
primacy,  we  would,  therefore,  expect  the  weight  for  persuasiveness  to  be  greater 
than  the  weight  for  reliability.  For  plumbers,  however,  the  ancaoring  would  be 
on  reliability  with  adjustment  (if  any  occurred  at  all)  for  pei suasiveness.  The 
result  would  be  greater  weight  for  reliability. 

The  judgments  in  this  experiment  were  expected  to  conform  to  a  constant 
weighted  averaging  rule  and,  with  the  exception  of  a  small  negativity  effect  that 
will  be  discussed  below,  they  did  so.  But  consider  how  the  situation  would  have 
differed  if  the  lawy'er/plumber  designs  had  been  set  up  as  follows: 

Reliability  Factor  Skill  Factor 

extremely  reliable  very  persuasive 

averagely  reliable  very  mechanical 

From  the  subjects'  point  of  view,  the  stimuli  would  be  identical  to  some  of 
those  judged  in  the  previous  design.  Presumably  they  would  process  them  in  the 
same  way.  But  from  the  design  point  of  view,  we  would  say  that  the  subjects  were 
performing  differentially  weighted  averaging  since,  for  a  given  profession,  the 
weights  on  the  two  levels  of  the  skill  factor  would  be  greatly  different.  Thus, 
psychologically  speaking,  there  is  only  one  averaging  process.  It  reveals  itself 
in  different  guises,  however,  depending  on  the  stimulus  design. 

A  final  point  should  be  made  about  weighting  effects  that  are  mediated 
directly  rather  than  by  order  of  processing.  Consider,  again,  the  data  in 
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Figure  7.  Note  that  for  the  four  panels  in  which  there  is  a  joh-relcvant 
adjective  (e.g.,  persuasiveness  for  lawyers,  mechanical  ability  for  plumbers, 
etc.)  there  is  a  tendency  for  the  curves  to  converge  at  the  low  value  of  the 
adjective.  This  is,  of  course,  a  negativity  effect  as  was  described  above. 

But  we  might  wonder  why  It  occurs  in  these  particular  designs  since,  presumably, 
the  skill  information  is  being  processed  first  for  all  three  levels  due  to  its 
relevance.  One  answer  might  be  that  subjects  have,  in  addition  to  ordering 
procedures,  other  procedures  that  modify  the  magnitude  of  the  adjustment  con¬ 
stant  depending  on  the  relative  importance  of  the  stimulus  being  integrated. 

If  this  is  so,  then  the  adjustments  from  "not  very  persuasive"  might  Involve  a 
smaller  adjustment  constant  than,  say,  adjustments  from  "very  persuasive." 

Such  direct  weighting  procedures  seem  feasible,  in  general,  and  they  would  be 
necessary  practically  for  proper  weighting  of  important  information  that  was 
received  late  in  a  temporal  series.  They  would  also  be  necessary  for  proper 
weighting  of  unimportant  Information  occurring  in  short  lists  where  late  order¬ 
ing  alone  would  not  be  able  to  give  them  small  enough  weight. 

Nonaddit ivity 

Constant  weighted  averaging  rules  and  relative  ratio  rules  both  display 
the  mathematical  property  of  ordinal  independence.  This  refers  to  the  fact  that 
when  these  rules  are  used  to  generate  data  for  factorial  combinations  of  stimulus 
values,  the  rank  order  of  data  within  rows  (and  columns)  is  the  same  for  all 
rows  (and  columns).  Multiplying  rules  can  also  display  ordinal  independence  when 
they  are  restricted  to  stimulus  values  which  are  all  positive,  as  would  be  the 
case  for  the  kinds  of  multiplying  models  discussed  in  this  paper  if  we  disallowed 
stimuli  of  zero  value.  Ordinal  Independence  does  not,  however,  hold  in  general 
for  differentially  weighted  averaging.  Although  particular  data  sets  may  display 
the  property,  other  data  sets  may  show  one  or  more  disordinal  interactions  (i.e., 
crossover  interactions!. 

Ordinal  independence  is  Important  for  present  purposes  because  it  provides 
a  way  to  test  for  the  presence  of  averaging  processes  without  relying  on  the 
quantitative  properties  of  the  responses.  In  other  words,  if  violations  of 
ordinal  independence  can  be  found  within  a  data  set,  then  it  is  possible  to  con¬ 
clude  positively  (within  the  limits  of  uncertainty  imposed  by  unreliability  in 
the  data)  that  the  rule  that  produced  the  data  is  an  averaging  process  and  not 
some  other  additive  rule  operating  with  a  response  transformation  that  mimics 
averaging  results. 
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Tests  et  additivity  have  an  Important  place  in  testing  whether  people  in 
Bayesian  inference  tasks  are  relying  on  processes  that  are  more  averaging-like 
or  more  ratio- like.  Two  experiments  were  run  in  the  present  project  (Topes, 

1982b)  that  attempted  to  manipulate  weighting  factors  in  such  a  way  that  differen¬ 
tial  weighting  would  be  obtained  in  the  machine  maintenance  task  that  has  been 
discussed  before.  The  major  change  in  the  new  task  was  to  designate  samples  not 
only  In  terms  of  the  number  of  rejected  parts  that  they  estimated,  but  also  in 
terms  of  the  reliability  of  the  estimate. 

The  results  of  the  experiments  were  somewhat  equivocal.  At  the  group  level, 
there  were  no  violations  of  additivity.  At  the  single  subject  level,  however, 
there  were  numerous  violations,  suggesting  that  the  judgment  process  was  ordinally 
distinct  from  a  relative  ratio  rule.  Unfortunately,  the  forms  of  nonadditivity 
that  were  obtained  included  not  only  the  particular  type  of  crossover  interaction 
that  would  be  expected  in  differentially  weighted  averaging,  but  two  other  types 
of  interaction  as  well.  Of  these  unexpected  crossover  types,  one  was  fairly 
easily  accounted  for  in  terms  of  the  direction  in  which  adjustments  were  made. 

The  other  one,  however,  did  not  lend  itself  to  ready  explanation,  thus  leaving 
open  the  possibility  that  the  process  might  be  more  complex  than  heretofore 
suspected . 

Since  nonadditivity  is  an  Important  topic,  it  is  worthwhile  showing  how  it 
occurs  in  a  serial  adjustment  process.  For  this  purpose,  I  will  use  data  from 
an  experiment  testing  an  averaging  model  for  similarity  judgment  (Lopes  &  Oden, 
1980) .  In  the  experiment,  subjects  were  given  pairs  of  kinship  terms  such  as 
mother/aunt,  grandfather/glrl-cousin,  brother/sister,  and  so  forth,  and  were  asked 
to  rate  the  similarity  between  the  two  terms.  The  model  being  tested  was  that 
similarity  would  vary  inversely  with  a  weighted  average  of  the  stimulus  differences 
on  each  of  a  set  of  1  relevant  dimensions  or  attributes; 


^iab*^iab 


Slmilarity(a,b)  =  o  - 


In  this  equation,  are  the  weights,  are  the  differences,  and  ti  is  a 

scaling  constant.  The  dimensions  that  were  actually  used  in  the  model  analysis 
were  sex,  age,  and  a  lineality  dimension  termed  "immediacy."  How  immediacy  was 
decided  upon  need  not  concern  us  here.  Suffice  it  to  say  that  immediacy  is  an 


Index  of  o’oseness  of  kin  as  measured  from  the  viewpoint  of  ego,  l.e.,  the  judge 
in  the  task.  It  runs,  from  closest  kin  to  most  distant  kin,  as  follows:  parent, 
sibling,  granoj)  I  rent ,  auntuncle,  great  auntuncle,  and  cousin. 

In  the  experiment,  there  were  several  dramatic  crossover  interactions  that 
appeared  in  both  the  single  subject  data  and  the  group  data.  Two  of  these  are 
shown  in  Figure  8.  Taking  the  crossover  on  the  left  as  an  example,  notice  that 
uncles  and  girl-cousins  (U/GC)  are  considered  to  be  more  similar  than  fathers  and 
girl  cousins  (F/GC) .  This  makes  intuitive  sense:  Although  both  pairs  have  the 
same  difference  on  the  sex  dimension  and  on  the  age  dimension,  uncles  and  girl- 
cousins  are  closer  on  the  immediacy  dimension  than  fathers  and  girl-cousins. 

A  reversal,  however,  occurs  for  greatuncles  and  girl-cousins  versus  grandfathers 
and  girl-cousins.  This  is  apparent  when  one  considers  that  for  these  pairs  also 
the  sex  differences  and  age  differences  are  the  same,  and  only  the  Immediacy 
difference  varies.  In  this  case,  however,  the  pair  with  the  larger  immediacy 
difference,  grandfather  and  girl-cousin  (GF/GC) ,  is  seen  as  more  similar  than 
the  pair  with  the  smaller  immediacy  difference,  greatuncle  and  girl-cousin  (GU/GC) . 

How  do  unintuitive  patterns  such  as  this  occur?  It  is  useful  to  think  about 
the  process  in  terms  of  the  serial  adjustment  model.  Table  1  gives  some  reasonable 
weights  and  scale  values  that  might  be  applicable  in  the  present  case.  Note  that 
the  weights  tend  to  decrease  as  a  function  of  the  magnitude  (or  salience)  of  the 
difference.  (In  the  full  analysis  of  the  experiment,  the  weighting  function  was 
actually  curvilinear  with  large  weights  being  assigned  both  to  large  differences 
and  to  zero  differences  as  when  both  terms  had  the  same  sex  or  the  same  age. 

Since  in  the  present  example  there  are  no  identities,  this  complication  can  be 
safely  ignored.) 

Table  2  shows  how  these  weights  and  scale  values  would  operate  in  a  serial 
adjustment  process  of  the  sort  described  algebraically  by  Equation  4.  If  one 
reads  across  the  columns  and  then  down  the  rows,  the  sequential  anchoring  and 
adjustment  operations  are  revealed.  For  example,  for  the  stimulus  pair  GU/GC,  the 
largest  dimensional  difference  is  for  sex,  with  a  scale  value  of  95.  Thus,  the 
anchor  is  set  to  95.  This  is  adjusted  next  by  the  value  for  age,  which  is  the  next 
largest  difference  (l.e.,  90),  The  adjustment  process  proceeds  by  noting  the 
discrepancy  between  the  old  value  (95)  and  the  new  value  (90)  and  adjusting  down¬ 
ward  about  half-way  due  to  the  relatively  equal  saliences  of  the  two  items.  The 
final  adjustment  is  for  the  very  small  immediacy  difference  (10).  Although  the 
discrepancy  between  the  old  value  (92.52)  and  the  new  value  (10)  is  quite  large 


Table  1 


Hypothetical  Weights  and  Scale  V'alues 
for  Crossover  Interaction 


DIMENSION  AND 

SCALE  VALUE 

WEIGHT  OF 

DIFFERENCE 

OF  DIFFERENCE 

DIFFERENCE 

SEX 

male  vs.  female 

95 

51.0 

AGE 

young  vs.  old 

90 

50.0 

young  vs.  middle 

40 

7.5 

IMMEDIACY 

cousin  vs.  parent 

90 

50.0 

cousin  vs.  grandparent 

70 

40.0 

cousin  vs.  auntuncle 

30 

5.0 

cousin  vs.  great  auntuncle 


10 


1.0 


Steps  in  Serial  Adjustment  Model  Producing  Crossover  Interaction 
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(-82.52),  the  adjustment  is  very  small  since  this  small  difference  in  Immediacy 
is  not  very  salient  compared  to  the  other  two  differences. 

Similar  processes  occur  for  the  other  three  values.  In  every  case  the 
Information  is  processed  in  order  of  the  magnitude  of  the  dimensional  difference 
while  the  magnitude  of  the  adjustment  constant  (i.e.,  the  weights  in  Table  1) 
reflect  the  salience  of  that  difference.  The  result  is  a  set  of  similarity  judg¬ 
ments  that  violate  ordinal  Independence,  thus  testifying  to  the  differential 
weighting  in  the  process. 

Note  that  in  this  example,  there  would  have  been  no  mathematical  reason  to 
require  the  order  of  processing  to  proceed  according  to  salience.  The  non¬ 
additivity  comes  directly  from  the  differential  weighting  and,  hence,  would  appear 
for  any  serial  Instantiation  of  Equation  4.  However,  to  the  degree  that  people 
tend  to  process  information  in  an  order  that  runs  from  most  important  to  least 
important  when  they  can  (i.e.,  taking  first  things  first),  primacy  effects  would 
tend  to  enhance  whatever  differential  weighting  was  already  present  and,  hence, 
would  be  expected  to  encourage  nonadditivity. 

Epilogue 

In  this  paper,  I  have  attempted  to  integrate  some  hypotheses  about  the 
procedures  that  people  use  during  judgment  with  the  vast  literature  already 
extant  on  the  algebraic  properties  of  judgment  data.  I  have  not  supposed  that 
these  procedural  hypotheses  in  any  way  replace  or  deny  the  earlier  algebraic 
work  since  that  work  provided  the  empirical  foundation  that  made  it  reasonable 
to  look  for  the  specific  psychological  operations  for  computing  various  sorts 
of  algebraic  results.  Instead,  I  have  seen  this  work  as  complementing  the 
algebraic  work  by  suggesting  the  fine  structure  of  the  processes  described  by 
the  algebra . 

All  of  the  research  performed  on  the  project  for  which  this  is  the  final 
report  concerned  the  Bayesian  inference  task.  Although  no  one  would  doubt  that 
Inference  is  an  important  psychological  skill,  many  might  judge  that  the  Bayesian 
task  was  played  out  psychologically  long  years  ago  when  so  much  research  was 
generated  on  the  topic  of  conservatism.  I  would  hope,  however,  that  the  present 
research  has  contributed  some  new  Information  about  the  procedures  that  mediate 
the  appearance  of  normatlvely  inappropriate  averaging-like  processes  in  Bayesian 
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The  three  separate  experimental  approaches  taken  to  the  topic  in  this  re¬ 
search  have  each  contributed  differently  to  the  project.  The  first  two  experi¬ 
ments  were  primarily  aimed  at  (and  succeeded  in)  confirming  the  existence  of 
the  kinds  of  directional  errors  of  adjustment  that  would  be  expected  from  an 
averaging  process  (Lopes,  1981).  The  second  two  experiments  provided  a  practical 
means  to  test  the  procedural  analysis  by  seeing  whether  it  could  be  used  to 
engineer  "better  Bayesians"  out  of  naive  subjects  (Lopes,  1982a).  The  result 
was  a  clear  yes.  The  last  two  experiments  attempted,  but  did  not  entirely 
succeed,  in  demonstrating  the  qualitative  necessity  of  an  averaging  formulation 
for  describing  the  judgment  process  (Lopes,  1982b).  Although  there  were  data 
from  some  subjects  to  support  the  hypothesized  nonadditivity,  data  from  other 
subjects  suggested  that  the  process  may  be  more  complicated  than  supposed, 
particularly  when  stimulus  samples  differ  in  reliability. 

Of  these  three  subprojects,  the  one  that  pleased  me  the  most  was  the  one  on 
building  better  Bayesians  —  the  deblaslng  project.  Perhaps  this  was  merely  be¬ 
cause  the  project  succeeded  in  doing  what  most  other  approaches  to  debiasing 
inference  have  failed  at.  Everyone  loves  success.  But  I  think  the  deeper  reason 
is  that  the  deblaslng  effort  was  accomplished  by  decomposing  the  judgment  situa¬ 
tion  in  psychological  terms  and  then  communicating  to  subjects  in  those  same 
psychological  terms  what  errors  they  might  produce  in  their  judgments  and  what 
procedures  they  might  use  to  avoid  error.  It  was  reinforcing,  indeed,  to  see 
a  cognitive  anal'-sis  do  what  other,  more  nearly  "black  box"  analyses  had  failed 
to  accomplish. 

Quantitative  judgment  is  both  commonplace  and  fundamental;  it  underlies  our 
most  basic  thoughts  concerning  the  quantitative  relationships  among  the  complex 
objects  and  situations  in  our  social  and  physical  environments.  At  the  present 
time  a  great  deal  is  known  about  the  content  and  algebraic  structure  of  judgment 
data.  Little  is  known,  however,  about  the  cognitive  mechanisms  that  generate 
such  data.  I  hope  the  present  research  will  move  us  a  little  closer  to  a  proce¬ 
dural  understanding  of  the  human  judgment  system. 
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A9 


Footnote 

^Thero  wore  some  noraddlt Ivlt les  In  the  Bayesian  tasks  used  in  hopes 
(1982b)  that  seemed  to  require  something  like  an  initial  impression  for  proper 
interpretation.  These,  generally,  were  hypothesized  to  involve  a  weaker 
anchor  value  (i.e.,  nearer  neutral)  for  unreliable  samples  than  for  reliable 
samples.  This  would  be  consistent  with  an  initial  impression  hypothesis  in 
which  the  unreliable  samples  receive  less  weight  than  reliable  samples. 
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